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ABSTRACT* 


This  report  examines  some  aspects  of  the  problem  of  allocating 
resources  in  a  multiprogrammed  computer  system.  It  first  investigates 
to  what  extent  the  users  might  participate  in  resource  allocation  de¬ 
cisions;  a  system  that  dynamically  determines  the  prices  of  services 
is  advocated.  A  model  is  studied  which  yields  a  balanced  set  of  pro¬ 
grams  in  order  to  get  a  good  simultaneous  usage  of  the  available  sys¬ 
tem  s  resources.  It  also  examines  how  resource  utilization  figures 
can  affect  the  choice  of  equipment  to  be  used  at  a  computer  install¬ 
ation  and  the  choice  of  a  swapping  algorithm  at  system's  design  time. 


•This  report  reproduces  a  thesis  of  the  same  title  submitted  to  the 
Department  of  Electrical  Engineering,  Division  of  Computer  Science , 
University  of  Utah,  in  partial  fulfillment  of  the  requirements  for 
the  degree  of  Doctor  of  Philosophy. 
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CHAPTER  I 
INTRODUCTION 

I • 1  The  Role  of  Scheduling  or  Allocation  in  Mul tiprogrammed  Computer 

Systems . 

For  several  years,  computer  scientists  have  been  faced  with  the 
task  of  organizing  large  information  processing  systems  which  many 
users  may  access  simultaneously.  In  these  systems,  there  are  a  number 
of  physical  resources  (cells  in  core  memory,  peripherals,  ...).  At 
each  moment,  some  of  these  resources  are  allocated  to  some  users  (it 
is  implied  that  some  of  these  resources  are  not  allocated  to  anybody, 
some  others  to  the  system,  and  finally  some  of  them  to  many  simulta¬ 
neous  users — like  a  shared  program  segment) . 

What  are  the  specific  problems  for  these  large  systems? 

1)  PROTECTION:  A  user  must  be  prevented  from  accessing  a  re¬ 
source  which  is  not  allocated  to  him;  or,  equivalently  a  user  should 
"see"  only  the  resources  that  he  is  allowed  to  access.  Several  solu¬ 
tions  which  are  more  or  less  satisfying  have  been  proposed  in  the  last 
few  years,  detailed  accounts  of  which  can  be  found  in  (1). 

2)  DEADLY  EMBRACE  occurs  when  two  or  more  processes  are  mutually 
blocking  each  other,  in  that  each  of  them  is  demanding  a  resource  that 
another  posesses  and  doos  not  want  to  release.  The  problem  of  avoiding 
deadly  embrace  has  been  solved  satisfactorily,  for  instance  by  Haberman 
[2] ,  who  assumed  that  some  facts  could  be  known  about  a  user  (his  maxi¬ 
mum  demand  for  resources)  before  any  resources  were  allocated  to  him. 


} 


3)  The  scheduling  or  ALLOCATION  pioblen  itself  (to  whom  th-?  r«- 
sources  should  bo  allocated  if  there  in  any  conflirti  it  clearly  ****»■ 
rato  from  tin;  problem  of  protection  and  can  be  n;<m  letely  separated 
from  the  previous  problem,  because  deadly  embrace  is  only  lethal  for 
certain  kinds  of  demands  which  are  not  relevant  to  the  allocator  (es¬ 
sentially,  access  to  shared  tables  and  shared  files). 

Problem  S3  is  relevant  to  this  study. 

1 . 2  Evolution  of  the  Problem  of  Allocating  Computer  Sysf^t*  M»c  jr  . 

The  first  computers  were  run  in  a  batch-process  I  no  rode.  Thtre 
were  really  two  processes-- the  system  and  a  prooram  to  lie  run  by  a 
user.  If  either  of  then  asked  for  a  resource  which  could  not  b*»  allo¬ 
cated  (for  example,  too  much  core  memory),  the  user's  program  was  sim¬ 
ply  aborted,  and  the  next  one  loaded. 

Later  came  the  idea  of  time-slicing  the  utilization  of  the  entire 
set  of  main  resources  (core  memory,  CPU  snd  disk  I/O).  At  the  end  of 
a  time-slice,  a  user  would  be  deallocated,  and  another  user  allowed  to 
issue  requests  to  the  available  resources.  The  entire  system  could  be 
considered  as  ju3t  one  biq  resource.  Response  time  in  such  simple  sys¬ 
tems  could  be  studied  by  queuing  theory. 

Another  idea  was  not  only  to  partition  the  time  domain,  but  also 
the  space  of  the  resources.  One  user  night  have  the  right  to  use  10K 
of  core  and  half  of  the  CPU  time  while  another  might  get  SK  of  core, 
half  of  the  CPU  time  and  the  disk  I/O.  Spaco-sl icinq  could  be  done 
either  independently  or  concurrently  with  timo-ulicinq.  Aqain,  if  n 
user  should  ask  for  more  than  the  resources  which  he  was  allowed  to 
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t«  }  ft*  M  |»  M;»OH  Of**  9  *  $»*#««! |#  **»»>»*  «  tllUfM  )  »  « 

Sf  fH#»|  ft*#  ♦**!  |  *'»'**■»>*•»  #0*1*1 0*t  *  «'  !U  <0*** 

4*  (•«)<$* >«!**§  !«  «*«  ®f  4  e«tf  *1*'  #*w*t  •€  ei %**■*»  ?**■  #* 

i&*  COWfc  rl*t  lkw*4  #  !»>*«  ►  ,  91  «**.«**  #f  fftw  **»**■ 

«!♦**,  *i%  gwWM«  tl*l4  4*4  4*  **M  |»»i*  I'Wffc  «Wtf»  I  *  I*’*  s*'***  ?W 
will  trr.  Im  l»  itwt.**'  #**#«*♦*<*  ii*4  to# 

f**l  II1N*  js*f  *,  >%f  10  0>**  |*’|  #*  |*0  Ct  fc*llu*'!.*ft  lit  b  •♦!*»> 

l«*  li  wilt  ittif »ei*l If  is  ^sUJ»  fU*s  ,*  *5«i  Nl  A<t«,  #*  o 

f#»'UW(  ©f  Jil*  ^1  w»*  *f  f«W  VM*  rr%irit  will  **#*>+» 

Of  C«MfN.  IMl  C*tt4lf.  Mi»  4*  Of  ft  ■"»!***  »*<#  »l*>  *  •  f  «»»!»» 

t0tw**ls  f*jM#f»l«r*  let  *Mf#i  Ik*  •/*  0O0  1:4-  ht*i  *H  I  soil «  fl*14  #  n  * 
lti^*4ff  It* I S*o0  tMcwt  5*  *lf|lMM^«  ti'**r«  .*M  «©*#* 

K»*4 

fMk  45^*4;,  W*tlCft  ©WftOiSl**  kf  IW'*  «f**-’t_»li_*  *  Of  l*<«  *11/**!  .  ,*  t 
«4llCft  CAfl  <i*  M?4N,  rtftal  5'i®  *i?X  Iks  olio  ..'411000  0?4>»f  IlMt*.  *t« 
nut  cmtciM  u©tf  it  twif  of!**.  ^nr  1*  $«  «t*  <*v 


t, 

(Mttiy  fAf  l««*  plenty  ii*  iHi  **1*11*9  in«i«*4,  a 

i*  p*  *■***«•■♦!  mt*  «*9*UMty 

I*  4ttfw«'l«.  #*»«**  II  I*  V« **•«*«* l*  If  f*i»*  (M  *****  w,9 

It  4*4  f#*t4  If  AlltflUAA  4*  1*1  . 

¥f  ifyi*y  I#  4*v*m4**  i  ^  w»,»piwm  **f  ib*  eei***#!**** 

*«*..<***••,  ,*M  t>y  w  M*4e  IM  f*  illy  #¥4t  1*1*1*  f*f*f 

|,V  **»♦*,  4  f  fM4«*Hl*J*  #*11*1#  *5  I1**  W*MI*I,  54*4*  lir  IV  *  f?**  )•*)*• 

«t  *  *#*•»♦  aM**  4.1*  !■♦'**« ***«  .  I*  l<rf!  •  f  c#«« •  * 

««<(*  *V<  »**•*  4*  *‘j*  will  ft*Hf  *f*.  #*  4**  4*4  ttf*1  tfHA|*4  f  *  «♦♦*»* 

I*  5ft*lt  I"*"****'*.  4*  v*M  I#  4*  f»**«*4v4)t¥  |«4'M**4  If, 

f *fcMf«>flw4 *  ,  «***M*t.  4f*  |4*  *«  jf  ********  VM  Ai  *#*•  twf  All**..  II  I* 

***I**M«  51.41  I.  <*■-  »•*»**  4**««*  9«r  #tr  5#  lift  <«*<*  54«*  *llKlM*4  #1* 
(#**im»*»  yl«tj>*9  |t***r^*<f*«4.1*  |f  >*5  *1 ) ,<*f  5  ]*«  m  14  *trf**%>  «***#*  * 
9V  if f ft *14  «*»ie*i4«4  If  4Ht|44|*4  5  >***  I  *$•♦«•«»**•*  **  >4)4  4*»  **♦» 

iMt...  «f*ft4*4.*.4*»'i  f#t  >9  4  m-»in*  If  t4w  «**  *»f  ii*  ****eiii  m  H  is* 

4*1 »  U4*4  «1<**»  l*  a  ffltl  #*m4>«*  #4  *********  *Mffc  f**  **» 
4<4*y  #*l»f  *r-  »«fc  «*AA‘*»**  flw*i  !<*  **?»*«  mm  »l**»  ♦*I»W*’».*4  l*5«****» 
I***,  «*ftfrll«*"*.  Ill*  Mf4ll*»  fVI***. 

M» f  f«  4»'%  5!»«  W«*  Iff  *|<w  M*  *4rlf*  f»*  4  f**1Alf  AfM***  #4 
5*  f*  «4M4A«lfl*<1i  1 4*  Wf*  *f  *-s:S*  4*  fM"f>-**y,  **«*  |«Hf  »*• 

M*>»*f  *t  •  •  ••$  4*  ff*ftfr|*  *f  *4*1  **llt  I-**  *1*4*1  1  4*  *  * »  if 

»M  f**4  if  4  **<»*{*♦»*  f*rtftfll«t  fw.4|W  It.?). 

Is  f*%l*>*  (ft  !'-•  <’*♦♦»•**  ft*54«*4*  If  4f 4)*'*|4  *f  *«*£•#*«* 

At tl*f 4*  1*4  4l9Pv*i5Mi*  If  I  *f45*’f*  v|  ||  1#  I**  f-*r  »****4» 
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i.J.t  Joi>  grVfulim. 

Mfoioncotsi  fn4o$tn«l  ?'he4ullfW  HI 
TV^ry  of  fch*4«liM  |4 1 

Wn*t  of  t H*  of  *cte4ul»nn  H***  Von  4mc  in  ct«r*Uf*vs 

i¥s*»fdt  for  i^=  j-fiitN  *»f  n  jol»*  *nic*»  *to  9©  v  #«*  m  »  tin** 

©rvr  of  »V  mnp*  cm-  fee  kuMr  inyo***  ot  *tt*i*t*tyi.  tv  ©m****  of 
l ttoi  I*  !»«**  finite,  #©4  !M  9i©r  #f*»nc  by  4  *»vo©  j«4*  ir  4  <#i*n*  *hep 
I*  >©’»'*«4  in  +*<««.  !•  )b!i  *****  cy* !«•»  fiwowf  y.**sr*  (Mom#  yv  ***r 
*Vy-  *«*t®  lfc»*k  nn<«l .  :v  414  i«  to  fiM  4  »»•  i<  t*Ui  in  o*»iotk 

4*144  **  '1*1  tw  t«  v<»4l«  i  y*>»> «  MSirti  f  t  iw*9*oc*% 

17'  V*of**e  tin*  rt*M  I tf  4  VM»  m  tV  %y*y«m> 

'TV  fflAlom  ***  4t«  ,  I-**©)  l»i«HHrt«l  «*»**♦  **  WII-piM  o©lc* 

4#*-  4  11911*  019  4i  ,f«  f*|**rt'  9*  ""14ft*1!*#  *y«  !**'*.  t4«  of  )  >* 

I*  inflMI®.  ?"•**#  *f»IW  f  **•,♦  <*4  1%  914*.  ,*of  )#4  *4  »*■.*« 

tiffMh  W*  '*A«by*g  0»*t  94®  |4rt  *f*  ,y***4o4  I*  *11  ft,***  *1*1 1  *9  •*-  ♦♦  *r  • 
UflUlH  1**1  **<*«£*  (if  t***»*f  Cm  * .  n«*a|ll*f  tfi^lfll^ 

life***  0**©  *!<v4i«4,  TV  ?*'Tf  l*4ip.f»s*|  *  ;*»  f If *41  *'t*‘  -tr 

IfHtirtlir,  4*  yi*!4  ©****1  tvtwlt*. 

'TV  •>»***■  •05«B'»i*4f  9ft****  4«s9ft#tA‘r|tt**>  Ifc  4  Of<*9<*4'* 

444ly*t*.  I*  Ml  *i'9»*,*|<-»n  Of  9V  **»lf  W*Vp  **>*1**4*  I***  f**» 
toflirtnl.  **9*V1V  *♦*  l«  It*  *  cow  10.01  lij*  #f  *l4**t4f  Ifo 

4*4  Vv*fl*9t€  4  JrP*»*ft4*  *  IM  fit  «  f  *||"*  *V#'  llfl*  rr«t* 

9M*'»jfti  9V  *  rt?  1.  A  Iff  10*1  Of  •  1*1*  ll*n-|  CM  Of  *©  l**|« 

TV  fUn#  4%  9*  Iff  9*  W»lt«  M  0©4»l*  'vVfo  *lt  4*9*  IH>„  I  low 

»t*»9  ky  4  it^i'y  c*,  4  f**OM*t#  <%f*  1?  iwm.  ft  I*  Vt  I  9# 

i»*  fccw  *»tfi  to*  j* i-rM**  co^ft4  v  o**4  f «*  root *it v  coo  *»i*f  «*-v4«ti©*. 


1.3.2  Cur  iff,  i  i  atus  of  yrttemn*!  Theory. 

Tt»*»  scitt  dul  inn  of  tir«-*har*<i  emputor  tyiifM  has  Nrert  studied 
in  Uw  light  of  4iu«*ui n<|  theory.  ***vievs  of  sut.it  rwjhods  can  foe  found 
in  JO, 7|.  This  r  virh  is  not  satisfying  because  It  applies  only  to 
vary  ample  Host  of  ism  tin*  tV  syu*»  is  rrdur^l  to  just 

cee  f re*  opt ifol*  rwwrce  fa  Cttl  t»f  a  distl,  and  a  aimi#  *tr*n*ry  li^ 

W  ttouni  Wtini,  or  IH  tPofrormind  hatStirnonil  is  »t>»Jif4i  <V  w|^,s« 
lie*  of  lh»  * /*»,«**  to  A  u«ef  Matts  to  f«U  IlM  «ll^  f«*$Owf  ’«* 
fct  a  certain  uo«  j^nnd.  i *  cwpeted.  TVs^  at®  rtvifsilr 

of  mo  h*lp  1?*  P>4f  f-tWhM,  !«•■  40*4*  l.v  knl«<  Of  A  **«-''>*»*>  «0>*» 

fnrtef  sysla*  are  tIN*  4|f*»  of  to*-  lr*  *jwt  l*»»  luMwitilh  of  It* 

tttVt  ytrti  tiw  »f«»i  of  «V  ctU’». 

J»*i#  ta**t  urn  **A\r*4  4s**  ms.  i*»n»m*  tfcat  **  !J* 

*r*tt>S*  t,**»a  *lb»*f  *roi|4  ;..«  •***"<  +*■**,  rt«»t  1st  *  trtowt*  »!!#■ 

«ltl«s  at*Of  tt&n.  ?Ofi*  statistic  r*S  la  t«tT  f*  |t»  »*»l  'V* 

r«f  fg»*tm*  O*  A  sys***  t*««"t  Ml  If*  4  *"*l  O*  *»•■**  ,  <*+<4 

to  0»l*f*4««  4*1  IMf  1*1  I *M'4**t*  «4M«f  isr«f  rtlO-  f  t  A  I  PH  til  *• 

tl«Ol  fo*v*-ref.  •*  h***  4  fc4hrH»»f*e  ailffMi  4  *!**•' fllV  *>*.*•  t  #o,  *e*v» 

#* *<mf of  •■#»#■*  c*n»t*f1*H*1lf«  oiKM  late#  *te**  to  k  tfp  l* 
daPS®*®**. 


1.1.1  “*■ '  .i. 

r-*»  mi  i*  itto  f*mi#  «' a*#!.* 

lfe»  «*t»  **S  a  ofl.tls  coofi station  tf  *y«iM  |>  t'  it 

Ife®  f»«sil**  at**  raltrl  OAily  IS  tfi®  ti«»  f*t  *SI  ff<P  ♦ISiJ  1  tJpfw 

Vm*  Ik#  foe*tW  f  vs#**  *fl*tf  ■«  *  ‘A'tnl  it  *e> 


lit^a  that  it  U  lAiutrallv  io  th*ei  * i »•  off«et  of  rach  of 

th*n  for  &»vr f«*  -«i*  of  vjlut-a  of  tha  lit*  roal  daooar  of 

iiouUiioo  i>'  w  hi*  cesuel  of  ibo  m«*  to*!  «if*«  r#lanpw>*iiM 
u#v<S*r  an  oosutm**  #?4tt  of  rosMli*.  *o4  t«  Mt*in*«f*  r*t  th**«  results. 
H*f*  aiain.  U  h  Wlt«>v**|  that  «l*4li>!  f  i  *t***l4  Ip  of  Solo  »o 

40  a  Itftl  laf'Kit*  MM1  tattoo  VS|c»i  *#l|t  *l#lct«Vl*»  #%« 
m  omooik  «£.■«#•' I«  ot  ) ♦«*»# m  ,  i«w*  oil  to  «•»!  *  t*  *  »  «  *i»«lu  »  al*r*#« 

lt£*r  vMrf»  e*».  I>«  «M«  t  If*  ««f  «-o  «V  rluo  *  •  ■  ♦  I  *  *  •  !  *•  **f* 

Hkmt* . 


!.)■ *  ■—*  ■  'r\  ■-  ■ 

VF-tirfS  -4^r  *  «*|l*t  4i'Mltt«Vi<»  t  f#  f.**  tf*  »• 

Of  *  tftlf*  iS*  #»«*  Of  JY*’  fo**  Oi#t  4)1  *  1 4  »*  IV 

CfV  do  1*1.  f«Wt  **">**t  |1i*>  *  I*  I*  t%*  **  t*  If  «rttll£4* 

tt*0  f|*||.  M  Ot»li#.4ll*"0  **  ffc*  two**,  ;  l»«y|»  of 

**wt»y  (113.11)1.  t*»l«  ♦*'*od»rf  i*s  rwtoi4*»  t  1  *»•  “**!♦»  *1.’  .aM 

|>*  *tfe>4  I"'  O  V  .'tin*  |'4  Mllia.Ftli*  **  «"*••  hllf.  ***»  **•'« 

l*>rYlff>  4  *f*  *r,  ■#*»'*) -M  4  *>l  ■**  *>W*  I  )  Iff  .*■  «a<4  f«W  ■>'•*  V  ■«*♦!  0*4#  w  » W 
*!l*sf  U3>«  fOO*i  ■>  of  V  •'"’Wf  •»«'#'  »»*  *  *«  («  »*<  l  Of  O  "4  . 

4Hf*  I*  »4*s  |l*.t».  v’M'S  *  IS  I* 

i%  **  ■"  o*  SI. 4. 

ST>«  t04.|»  V.lfc  ***•  1  |Ot**f  4*10*  0*4*1*  I*  •%*«  t*«-y 
*  *-*■*•  »*t  i*  w»s-  of  *trtt*>«v*  swftAfo,*,*  *  lot  o«f.f*|f>  «r<  %*  *coo  '<•*♦11 

Oft  *>o  *»♦  of  'odfY  * ♦■*’«'  *«•-(  to  *«a  «*vt'  f  1'  -o  (*.Ht.  '■s***. 
f.wt  to  w*»4  *ir f  *1  lot  I*  4*0  at*  IV  t*»  f»*  V#l-4*»r**  **!* 


lb 


It  ifs  wortii^iile  to  briefly  rent  ion  a  very  irolftirrl 
token  by  A  leon  At  UCLA.  Absolutely  everythin*!  namut  the  |jr©*irim*e 
MHtviouf  It  eopfooeett  to  b»  intwn  An»1  *ynth«»iml  in  a  ttlfp«**l  or  Aftr. 
(hint  this  infetMtion.  Hovel  |l*>)  studies  nenutry  *M  >ifOv'»«nf  Alio- 
rttinn  fct  a  ftetfAfl  niwbi  in  a  wilt u  vy*»t*o.  'me  »*  a*» 
errofl?  of  bn*  a  ttnor  torhl  cf!l«lr*  hie  vn  r  ^»vi  «>  a! irn  f  r  a 
#lv*n  envito«Ainnt«  tut  the  m<*»|*  of  ororr *n»  t»*«wo  i*#a  eeys.i  'ifAt^l 
10  slh«  the  finJhvr  Of  4  **t  «f  tN^t*  V|ren»1  ****•«  *  »**»  r*  t« 

bo*  t. 


»*4  jta-V -ftL  LKlrfti 

f  Iwllrn  Ifost  U«l*  Ai«  *."**«  f***J>*si  <wT|jy  A  «*»«»  aMN||4  l'*t,l*'|» 
b«!«  IS  (b*  »*•»  &**»«<*  »t  I  >  Atlmn  4*  UliWtj  ?  .»*  U**n  (ft****  |  .  *  » 
Mn*  ibis  foe  oef»t«*e*«l«  H*te  MbM  lelly  t*,<  *fpr  »»»#»«• * s 
||  «**r  «aa  noA*  tfo*  •ft«ct»ie*»*  foi***lt.  m, 

21  it*  **n  site  tfoe  *f*t«m  ettn*  inform* i»  n  **»«•*!  bis  ewn  foe- 
ore** •  y,  *a»a  t*»**ii»|«-  it  «>t  *s#ffs  e^f-v*«n  im  of  **tir»**. 

«*sA  I**  l*  nt**  tl»  Hfo<»*ieoA. 

'be  snootti  only  'worry  Afoot**  c»nflie?A.  **4  try  to  ninimrr 

lfo#»n  to  jtfcwt  a  o»*t  ft  mo  n<Of)»lui»it  siw  v*nl«*  neMM,  It  Is 
M««*»*ry  to  itenei  u.n  by  Iw4im  a  (ei"^  If  a  v**r 

n#t*  of  a  reft  tin  revere*,  k*  m*  tr  $**f  n»i«i 

The  eoflp»*er  it  then  t^siA"!  I  a*  a  *>rt«i  of  f«»n«f  £>«„ 

A*  e.et*»  ^«^^nen, ,  the  ot«r«  144  f  or  sin  tie  r**«Nif  e  or  *rw«*  s«i  of 
r*e&MT*e«  for  a  f*rie4  »o*  ur*.  lb?  ryst**'  waM  itself  m  a 

tel  of  feivWM?  in  orler  to  t^Unite  it«  «*i  |»eftt  ifor  irwttve.  *  v» 


a 

of  lh«  4C?«  1*  4  Ultlsl.  T»u»  prices  of  t*w*i»rr«  wwM  h*  ♦l**t#fm«*f  I 
tl/nwiol'.v  liy  if**  *ye!e*,  *s  *  (w  uw  of  »H»  lo*-l  *^mv  of  *■><*• 

TV  ft<>£*tr*  i»  ln«r  <*-me  l»f ***M  V  **»«  of  I  m*«  • 

vt;*ch  ‘ii  r*M  4  l*w  #))>  4!lf«t  «f  ««5^(o-  »  |*v*  ♦»>** '  l¥  I  f 
«iw<f  4*4  4*M  If  Itl#  W«IIN»  4  IN*'  "»•**  *•  »fl*4»* 

*#«  *l«*4i«4  S*i  *  ♦  SI.  A-  ■**»-***■*  I*  ,#1*4*  *«  **-  »*«l  f#4  MywOMrMNp 

IV*  »*}  Of  4l  wv  I*  f-Mh  \»  H  N  ***«♦. 

SSI  4  *h*V  |  vA-iOfc.  .**  **  »  *•  *1  *  '*  *  I  >* 

Of  |JN*  »f  f  (VO*  *«*-fliJ't  ("***♦  4  *M  »i*>  «!•  |*  fr  «'»•*!  * 

I*  *•*  w*,  ##-***4  !  **  I  **-!•**#  P*  ’  ••  <**  «**  IW 

|)r*  iflioi*  I  I*  *»»«4  4  *4  *  *  »*  <t« f* .  V*»  !'.** 

IV*  f4«4!4»  vf  1*.«  |*t*4W,m**»  I"1  !■«  *»'»***-  I*  I1  *ll**t  *: 

?*#  M  ***♦■»*  £*»**♦**’*  I*  '»*«♦¥  |  f.*M  If  II  I*  '  *  1 1 m 

l»*  *fr*f ♦“f  If  W**  «vi#itf4.  I  *  ♦••’■  lnl** 

i t*1*  1*0  f^trll  »•*<.  If  IV*  |*f '•♦■ft-** l**>  *Njn*l  »♦*  4Mm*I  *t  S»  *  v> 
*l*@4*.  II  fO*!  ;**  of  *>*“»  4*1*.  I  »  f***'l**.  4*4  S  IM»ol|!*.4l*4 

|{M|<  *<w»4|*<t  44l!»«'iftiHI0*l  4*^4-  I.  V|,!S  |W  *■■'*'**.*1  O  of 

*  ♦f  Or**’*  *lv*tir-fl*  «Hfc«**r  *  I'**1!  *1  fowl  f  «*•* 

tllf  Ifc  t»T*|  m«**  Oft  f  l»'»0  fo*fl*Ntl*»  -T^-  *“*  *.■»*'«•  *>M»*  COWfOlIf** 
j<«iV«t  nol  i  S**  v  •  *  ?*>»  0*»*H{|* ». 

(Q0+  1*f  l*  ♦  nMft***  V-5v>#  4*  *t*!»4*«  »►  *-t*l  liV",  |V*  f*ti  '*  t  Of 
*4»I*N**  HMt  «f  V^-ty  4*4  fVO,**  I  4«tl'**  <40  Ol‘  i?i  «**•  fVl***  of 
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authors  of  schedulers  with  multilevel  priority  queues  always  con¬ 
sidered  the  user  as  passive  or  inert:  all  users  are  equal,  and  a  user 
cannot  react  to  the  service  ho  is  getting  except  by  modifying  his 
pattern  of  requests,  or  by  simply  leaving  the  system.  Moreover,  the 
pricing  of  computer  usage  was  based  on  flat  rates,  which  are,  as 
Shawn  later,  not  dynamic  enough  and  thereby  lead  to  josses  of  effi¬ 
ciency  [K>1. 

Another  factor  which  tends  to  limit  the  size  of  computer  sys¬ 
tems  is  the  existence  of  non-linearities  in  the  overhead.  Tf  the 
total  overhead  grows  faster  than  a  linear  function  of  the  size  of  the 
syoten  or  the  number  of  users,  there  is  a  critical  sire  where  it  gets 
unbearable.  Tor  instance,  if  there  are  n  processors  accessing  m 
memory  banks,  the  complexity  of  the  cross-bar  switch  is  known  to  be 
proportional  to  n  x  n[17J,  and  the  time  to  solve  conflicts  is  propor¬ 
tional  to  log  n  *  loy  n,  both  terms  introducing  non-linearities.  As 
far  an  allocation  is  concerned,  our  linearity  criterion  forbids  us  to 
spend  rwre  tip**  or  computing  power  to  nake  an  individual  allocation 
derision  on  a  larger  system.  This  is  a  very  drastic  condition.  Note 
'that  most  smart  pane  replacement  algorithms  (like  least  Recently  Used 
or  Annina's  voiktnq  set)  do  not  satisfy  the  criterion,  while  simple 
algorithms  MFC)  do.  Note  that  the  swapping  algor  ithm  which 

will  be  presented  in  IV. 2  doer,  satisfy  the  linearity  criterion,  while 
the  one*  of  5V.1  and  of  Chapter  III  do  not. 


CHAPTER  II 


PRICING  AND  RESOURCE  ALLOCATION 

II. 1  Introduction. 

Suppose  that  a  coffee  shop  was  serving  ice-cream  to  people  on  a 
first-come-f irst-served  basis,  without  asking  them  to  nay  for  it.  As 
the  news  passed  through  the  town,  an  enormous  queue  of  children  wait¬ 
ing  to  get  their  ice-cream  was  formed  outside  of  the  shop.  Some  of 
the  children,  after  getting  a  first  ice-cream,  were  going  back  to  the 
end  of  the  queue  and  waiting  for  a  second  one,  and  so  on.  When  the 
shop  started  asking  a  quarter  of  a  dollar  in  exchange  for  an  ice-cream, 
the  queue  vanished. 

It  seems  that  computer  scientists  were  slow  to  find  out  that  a 
computer  system  is  just  a  service.  If  it  is  given  for  free,  there  is 
a  tendency  towards  misuse  and  efficiency  is  lost.  I  expect  this  to  get 
more  obvious  as  the  extraordinary  growth  of  the  computer  industry  slows 
down. 

Allocating  resources  was  defined  as  solving  conflicts  between 
simultaneous  requests  for  the  same  facilities.  But  wouldn't  it  be 
better  to  just  avoid  those  conflicts  by  pricing  the  resources  high  e- 
nough  so  that  the  number  of  them  is  greatly  reduced? 

There  are  two  possible  philosophies  in  relating  resource  allo¬ 
cation  to  an  economic  system;  they  are  given  here  for  the  case  in  which 
there  is  only  one  resource,  but  they  can  be  generalized  to  more  compli¬ 


cated  situations: 
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1)  The  "a  priori"  pricing  philosophy.  The  system  chooses  a 
price  for  the  resour  .  As  soon  as  a  customer  arrives,  ho  gets  the  re¬ 
source,  provided  that  he  wants  to  pay  the  price  and  that  the  resource 
is  still  available.  From  time  to  time,  prices  are  adjusted,  depending 
on  variations  of  the  offer  and  demand  levels.  This  was  advocated,  for 
instance,  by  Nielsen  [16]. 

2)  The  bidding  philosophy.  Each  user  submits  a  bid  for  a  re¬ 
source.  At  a  certain  time  (chosen  by  the  system)  ,  the  resource  is  given 
to  the  highest  bidder.  In  Sutherland's  yen  system  [18],  the  previous 
bids  are  known  by  all  the  users.  In  the  case  of  a  real-time  bidding, 
the  bids  would  be  secret  (essentially  to  avoid  the  overhead  of  letting 
the  user  consult  the  currently  expressed  bids).  This,  however,  does 
not  mean  that  the  user  would  pay  the  full  amount  of  money  that  he  offer¬ 
ed;  in  fact,  I  suggest  in  II. 2. 4  that  the  user  should  only  be  charged 
the  minimum  amount  that  he  would  have  had  to  offer  to  get  the  resource. 

The  bidding  philosophy  has  two  advantages  over  a  priori  pricing 
and  one  drawback: 

1)  The  highest  bidder  always  gets  the  resource  (and  not  the  first 
to  arrive) . 

2)  The  bidding  itself  automatically  determines  the  price  to  be 
charged  to  the  user,  so  that  no  price  adjustment  is  necessary. 

3)  However,  with  the  bidding  method,  the  user  doesn't  know 
whether  he  is  to  got  the  resource  until  the  time  arrives  for  the  auction 
to  be  closed. 

Note  that  both  philosophies  can  be  combined  in  the  following  way. 
The  system  sets  a  price  at  a  certain  level  above  the  average  price  at 
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which  the  resource  is  expected  to  bo  sold.  A  user  hen  the  choico  of 
reserving  tho  resource  immediately  when  he  asks  for  it,  at  the  price 
sot  for  it,  or  taking  tho  chance  of  waiting  until  tho  time  has  arrived 
where,  if  tho  resource  has  not  yet  been  allocated,  the  bids  are  ex¬ 
amined  and  tho  resource  allocated  and  priced  in  the  second  way. 

I I . 2  User  made  decisions  in  a  system  with  decentralized  control . 

I I. 2.1  Some  definitions. 

Facilities.  There  are  several  kinds  of  resources  (or  facilities) 
in  a  computer  system.  For  instance,  a  cortain  number  of  bits  in  a 
storage  device  is  one  resource,  while  accessing  (writing,  reading,  or 
oxecuting)  those  bits  is  another  rer.ourco.  A  piece  of  software  (like  n 
compiler)  can  itself  be  considered  as  a  resource,  which  can  lie  bought 
or  rented  for  money,  but  in  the  following  developments  only  the  hard¬ 
ware  resources  will  be  considered. 

Note  that  anything  demanded  by  the  user  can  be  called  a  resource 
(execution  of  a  programmed  operator,  having  a  certain  program  in  core 
memory,  etc...).  Ultimately,  money  is  itself  a  resource:  thin  notion 
will  be  useful  in  a  later  section. 

User  and  System.  The  user  is  not  just  the  human  being  who  pro¬ 
grams  the  computer:  it  is  an  indoeendent  d«’cinir>n-JMk»n<i  entity,  ton- 
posed  of  tho  human  and  his  programs,  and  even  so-called  "system's 
routines"  that  another  part  of  the  user  has  decided  to  activate. 

The  system  is  a  particular  user  which  nn’ses  resource-allocation 
decisions.  Computer  operators,  managers,  and  basic  system's  proor.ir-t 


belong  to  the*  uyutem. 
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11.2.2  Decmlonn  mado  which  are  /clovant  to  the  user. 

The  sophisticated  user  might  want  to  make  tame  derisions  which 
arc  ordinarily  runic  by  tho  system.  The  first  idea  which  comes  to 
wind  is  that  the  user  dejiror.  lo  optimise  himsol ?  relative  tohl* 
environment. 

Supjxwe  first  that  each  resource  has  a  price,  in  available  in 
any  .mount,  m*i  that  the  user  trios  to  minimise  tret  unit  rest  of 
running  in  such  a  system,  hater,  it  will  l>e  seen  that,  in  general, 
such  a  reasoning  is  too  uimpl tut lc,  because  the  users  defend  is  not 
small  relative  to  the  total  amount  of  resource*  In  the  syne-**,  Eco¬ 
nomists  say  that  the  demand  is  not  atomic. 

vfhat  are  tone  of  the  decisions  which  chi  Id  !«e  made  ly  the  user? 

1)  fiizo  of  his  working  set  of  pages  tn  first-level  srtery  (in 
a  paged  system) . 

2)  Choir,  of  an  urgency  (total  bit)  in  a  bidding  sys'tm,  or 
priority  lev.  1  in  a  sy*t#->  w  *  re  a  cost  is  associ  ated  witu  each  pri¬ 
ority  level). 

ii  Si co  do* trod  f*  r  1/0  suffers. 

4)  Ee  id-  i.ci'  of  a  liven  sc  tmeni  at  a  certain  memory  level,  Hes- 
Idenry  of  a  flic  at  a  certain  O'Mry  level. 

!*)  ColUctlni  and  eventual  giving  to  the  syrne**  of  some  statistic* 
an  a  program  which  i  »  often  run.  The  idea  t  lit  a  user's  kfiwIHie  in 
important  in  t  rd  t  to  gel  l-etter  j.vilm  has  been  advocated,  for  Inata-ce, 
in  |l?.i  1.  S*  rating's  *lt  oni’ration  |1?)  that  his  %«i  rhino  set-  al- 


prfufw  a*  tier  than  I..K.U.  M^**ntly  #e*tl»  csn  I’*' 

reinterpret**!  Wy  Hut  the  inferiority  of  is  kImi  if 

handles  all  uicrs  in  th*  *****  *»y* 

The  u«*-r ,  as  v^i  4,4Vf»  tWlned  it.  could  C4 1 1  <wi»  <Mr^l 

lyfiiM1*!  routine  to  «Jo  f»4*  jeh.  (bw^wf ,  I’tls  »m!Im* 
umJffr  responsibility  of  the  user,  *^v>l  he  u^lil  l«  eh*f»ie*l  for  the  re¬ 
source*  by  this  routine. 

“he  re**t  *m^ciiM  «ti  1 1  she*  an  •**-?  I*  of  *■***  iishoff?  n* 
tv**«n  the  osa«|*»  of  f'wril  resources  cm  *>e  W  *h  *»4  t»»  user 

to  Irf.rove  u»«  rtifritcf  of  hi*  j-rv'eM**.  hi  oi*^t 

ever,  such  trad^-eff*  cajmwi  i*e  trusted  to  lb*  us*t*.  NrCAU**  the 

*.wo  »i rateuy  h»*  lu  k  for  sit  |>r«m*e*»  ift  t  k*  *v*t«**.  HvcH  s 

tl tuation  is  4Mlyf4>J  in  IV.)  teH*fi«*r  IVl, 

I  f .  ? .  1  An  »«yrlr  sVyjiif  tt*lM-«ff»  l*t***n  I  Ho  u*o  of_  **?***). 
ty  A  u»rf  .1  o»»»  *!  » "• »_  r*m**l  jet . 

Pi  fictitious  cfisfller  ho*  !•  wri*  lor  ti'tfiM*!.  each  of  *hich 
h*f  certain  character  1st  lea  a*  fir  a*  li*e  lerilifr  of  the  pr^ra*  tr- 
frrenm  is  wiwim^. 

(I)  Jf/ftt  t*  analyser  aoJ  If  crwraphlcal  analy^r 

?IH  of  iS"  *»■*  »ty  ref*r*>f  *r*,  and  ©c^u**!**  f.1  m  rds 

(?)  frrer 

y.li  of  referenc**,  <f  words 

IJI  StleriMfi^r  twl« 

|*,%  of  rphrsnfes,  If  sottli 

<41  Co4;  *oi  data  sofwnte  currently  I 

4i  of  references,*^  worts 
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<j|  fyni4*  I4hl«i  and  roi it  »***■* 

^5*  of  All  references,  INC  words 
■|!*e*e  *»  tyy*n  of  #v<f««*nt*  have  very  different  urorertlesi 
(It  E4eH  it*#  (1)  Is  ac««s«Hl,  *lrs»pH*t  All  of  it#  word*  ere 
4CCSM#d, 

(2)  h  A{c«««<d  wry  fAldv. 

(>)  Has  4**9iii  A  to  I*'  word*  of  Information  tor  identifier, 

04"ft  of  vf»t«h  is  a<«a«oI  about  once  of  twice  oarh  tine  the  identifier 

1 4  AfC9»M. 

(4)  Is  accessed  quite  randomly!  two  con*ecutlve  accesses  are 
never  conti  otou*  or  very  clone  in  time,  and  not  always  contiguous  in 

»pAM. 

(5)  Accesses  to  IS}  arc  frequent,  but  not  very  correlated  in 

tin*. 

The  masory  system  has  1)  fast  register*  accessed  in  ,1  micro¬ 
second,  and  2)  core  memory  accented  in  1  microsecond.  The  hardware 
allows,  fer  instance,  swapping  panes  of  .12  words  between  the  two 
levels  of  wenorv,  with  a  renlaccmont  algorithm  of  the  "workinn  set" 
type  of  Teter  Penning  (13].  The  slr.c  of  the  available  memory  is 
supposed  to  be  larger  than  what  the  user  ninht  request. 

Assume  that  there  exists  a  rricina  system  for  the  computer  re¬ 
sources,  with  a  prici  Pj»10  for  a  memory  cell  of  tyuc  1,  and  p,=l  for 
a  t"p«  2  nemorv  cell,  per  unit  of  time.  In  thin  simplified  model,  the 
costs  of  the  CM)  and  of  the  swappinq  bus  between  the  two  memories  are 
supnosed  to  be  negligible.  This  system  of  pricer,  is  supposed  t  >  bo 
quite  stable,  and  the  user  can  assume  that  it  will  not  vary  more  than 
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•lightly  over  a  rather  long  period  of  The  user  has  to  determine 

optimum  residency  or  swapping  strategies  for  pagen  of  his  various  seg¬ 
ments.  This  assumes,  of  course,  that  the  hardware  in  able  to  recognize, 
for  each  user's  segment,  which  strategy  should  apply  to  it. 

The  detailed  computations  underlying  thin  example  are  not  shown 
here;  however,  they  indicate,  with  some  restrictive  hypothesis,  that 
the  optimum  strategy  has  to  leave  (1)  resident  in  fart  registers,  (2) 
and  (4)  resident  in  core  memory,  and  (3)  and  (5)  should  have  pages 
swapping  between  tho  two  levels,  with  a  working  set  size  of  about  32 
references.  If  fast  memory  had  been  much  more  expensive  (overloaded 
system),  (2),  (3)  and  (5)  should  have  been  resident  in  core  memory. 

The  point  is  that  it  is  possible  to  linearize  the  average  cost  of  one 
reference  for  a  given  segment  and  a  given  strategy,  in  tho  form: 

C  *  C0  +  al  pi  +  a2  P2 

where  p^  and  p2  are  the  prices  of  the  resources,  and  a^  and  a2  are 
coefficients  which  depend  on  the  chosen  strategy  for  the  segment.  This 
allows  quite  fast  determination  of  the  best  strategies  for  given  costs 
of  resources,  before  running  the  compiler.  But  lot  it  be  stated  again 
that  this  optimization  job  is  relevant  to  tho  user  and  not  to  the  sys¬ 
tem  (the  concept  of  user,  of  course,  includes  programs  working  for  the 
user) . 

Note  that  if,  for  the  best  possible  set  of  strategies,  the  average 
cost  of  a  program  reference  is  too  high,  the  user  might  decide  to  delay 
his  run  (the  threshold  might  be  a  function  of  the  urgency  of  tho  job) . 

In  Figure  II-l,  a  strategy  is  represented  by  a  point  whose  coordinates 
are  the  resource  utilizations  of  the  strategy.  Strategy  3  it;  optimal 
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Fig.  IX-1 


Trade-offs  between  several  possible  strategies 
Strategy  #3  is  optimal  (lowest  cost) 

The  parallel  lines  join  points  of  equal  cost- 
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became  its  co*t,  which  is  a  linear  function  of  the  renourco  u »aqo, 
is  w.ninue.  iiote  that  thin  figure  does  not  pertain  to  the  compiler, 
but  la  just  an  illustration  of  roiw  possible  strategics. 

IX. 2. 4  Pricing. 

Modern  oc  fiosnir-ts  like  Dcbreu  (21)  have  shown  that  a  wrong  pric¬ 
ing  of  resources  leads  to  inefficiencies  and  loss  of  potential  power. 
They  advocate  a  system  of  marginal  prices.  For  instance,  if  the  demand 
for  computer  resources  is  low  during  night,  the  price  should  be  corre¬ 
spondingly  low.  if  thure  is  just  one  user  on  the  system  (no  conflict 
in  demand),  the  price  should  be  just  equal  to  the  marginal  cost  of 
keeping  the  system  running  (the  cost  of  the  operators,  plus  of  elec¬ 
tricity)  . 

I  believe  that  the  following  points  characterize  a  fair  system: 

1.  It  will  always  sell  a  resource  at  a  marginal  cost  (see  point 

#4)  . 

2.  It  does  not  make  any  distinction  on  behalf  of  the  user  (name 
of  the  user  or  previous  history)  .  In  particular,  if  the  user  is  willing 
to  pay,  there  is  no  reason  to  penalize  him  even  if  he  has  used  many  of 
the  facilities  of  the  system  in  the  recent  past.  In  other  words,  there 
is  no  implicit  priority  system. 

3.  The  system  will  never  charge  more  than  the  user  announced  he 
wanted  to  pay.  Nevertheless,  the  user  might  be  given  the  resource  at  a 
price  smaller  than  this  maximum. 

4.  The  general  rule  for  allocating  resources  and  charging  for 
them  is  the  following:  the  system  takes  the  allocation  decision  which 
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maximizes  its  profit,  but  nevertheless  it  charges  tho  individual  user 
tho  sum  run  amount  of  mo.  >iy  that  this  user  would  have  hod  to  offer  to 
get  the  resource)  all  other  bids  being  unchanged. 

5.  An  immediate  consequence  of  rule  4  is  that  two  identical  re¬ 
sources  will  cost  the  sa r.c  at  the  somu  time,  independent  of  the  usors 
they  are  allocated  to. 

6.  If  a  sharablc  resource  is  available  during  a  time-slice, 
then  any  user  may  use  it  without  any  cost  to  him.  This  takes  care  of 
the  reentrant  routines,  for  which  only  one  user  pays  at  a  time  (the 
first  user  to  get  the  routine  in  core).* 

7.  The  system  must  distinguish  between  conflicting  demands  for 
a  resource  (most  of  the  practical  cases) ,  and  cooperating  demands  (for 
instance,  a  reentrant  piece  of  code) .  In  the  latter  case,  the  different 
demands  are  considered  as  only  one,  with  the  maximum  amount  of  money  be¬ 
ing  the  sum  of  what  each  user  wants  independently  to  spend  for  this  re¬ 
source. 

A  general  idea  underlying  this  thesis  is  that  marginal  pricing  will 
have  a  good  effect: 

1)  By  trying  to  get  better  response  at  a  lower  price,  the  users 
will  increase  the  system's  efficiency  rather  than  wp_k  against  it  (the 
problem  of  counter-measures  has  been  reviewed  in  [22]). 

2)  Statistics  will  be  provided  to  aid  the  users  in  estimating 
their  chances  of  getting  the  desired  response  for  a  given  amount  of 
money  at  various  hours  of  the  day. 

XA  more  complex  model  could  be  imagined  under  which  the  cost  of 
the  resource  would  be  shared  by  the  participating  users. 


3)  Statistics  will  show  the  systems  managers  in  which  equipment 
lies  a  bottle-neck  or  which  equipment  is  not  really  needed. 

There  is  another  reason  for  marginal  pricing  in  a  system  whoru 
the  same  resource  is  reallocated  very  often.  Without  it  a  user  at  an 
open  auction  would  automatically  arrive  at  marginal  price  anyway,  by 
slightly  increasing  his  bid  until  he  got  the  resource  (or  the  bid 
reached  the  limit  of  what  ho  wanted  to  pay).  In  any  case,  the  over¬ 
head  implied  by  such  a  strategy  may  be  avoided  by  assuring  the  user  of 
a  "fair"  price  even  if  he  immediately  submits  his  maximum  bid. 

II. 3  Indivisibility  in  space  of  the  user  requirements. 

The  previous  paragraph  handled  cases  where  the  demand  could  be 
considered  as  being 

1)  atomic 

2)  for  a  resource  which  could  be  allocated  independently  of  any 
others,  and  independently  of  any  previous  or  future  allocation  of  the 
same  resource. 

Alas!  This  is  not  true,  in  general.  It  is  impossible  to  allo¬ 
cate  just  IK  of  core  to  a  program  asking  for  3K;  better  not  to  allocate 
any  resource  at  all  to  that  program. 

In  this  section,  is  considered  the  indivisibility  in  space,  where 
the  space  considerid  is  the  space  of  the  resources.  A  given  user  asks 
for  a  set  of  resou;ces,  for  instance,  for  the  duration  of  a  time-slice. 
Now,  consider  how  the  system  reacts  under  both  the  pricing  and  the 
bidding  philosophies. 

1)  Under  the  pricing  philosophy,  a  user  is  allocated  as  soon  as 
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he  a&Ka  for  his  sot  of  resources,  if  there  are  enough  available  re¬ 
sources  to  satisfy  him. 

2)  Undor  the  bidding  strategy,  all  users  are  allocated  at  the 
same  moment,  and  the  system  tries  to  allocate  a  set  of  users  in  a  way 
such  as  to  maximize  some  economic  criterion.  If  one  tries  to  achieve 
a  balance  policy  (i.e.,  to  allocate  an  equilibrated  set  of  users,  to 
better  use  all  the  resources) ,  then  the  bidding  philosophy  has  to  be 
adopted. 

Denning  (12]  has  proposed  to  formulate  the  allocation  problem  as 
a  0/1  integer  linear  programming  problem  (also  known  as  a  multidimen¬ 


sional  Knapsack  problem) . 


If  user  #i  asks  for  an  amount  a. .  of  re- 
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source  #j,  then  the  system  has  to  find  a  set  S  of  users  such  that: 


i<S  aij  1  Aj  ™ 

where  is  the  available  amount  of  resource  j.  The  economic  criterion 
(cost  function)  has  the  form: 


E  =  £  c 

,  _  i  ,  where  c.  is  the  bid  of  user  #i. 

i£5  i 


The  resources  are,  for  instance,  core  memory  and  CPU.  Suppose  that 
one  user  asks  for  25%  of  the  CPU  and  50%  of  core  during  a  certain  time 
interval,  and  another  user  for  70%  of  the  CPU  and  40%  of  the  core. 


Clearly,  if  both  of  them  are  allocated,  they  will  not  take  more  than 
90%  of  core  and  90%  of  CPU  (see  figure  2) ,  and  thus  they  can  be  allo¬ 
cated. 


This  solution  to  the  space  indivisibility  problem  is  not  entirely 
satisfying,  because  a  compute-bound  user  might  request  100%  of  the  CPU, 
and  so  should  be  alone  in  the  system.  However,  suppose  that  this  user 
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Fig.  II-2 

Multidimensional  Knapsack  Allocation 


CPU 
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had  only  requested  50%  of  the  CPU  time.  Another  user  could  then  have 
been  allocated  (for  instance  an  I/O  bound  user  who  requests  the  CPU 
quite  saldom) .  This  new  solution  would  be  much  more  optimal.  In 
other  words,  to  U3e  the  terminology  introduced  in  chapter  II,  the 
progress  rates  should  not  bo  determined  by  the  users,  but  by  the  sys¬ 
tem. 

Note  that,  even  if  the  constraints  are  satisfied,  the  allocated 
users  are  not  guaranteed  the  service  that  they  requested.  For  instance, 
suppose  there  are  2  users,  each  of  which  asks  for  45%  of  the  CPU  and 
45%  of  the  disk  channel.  Together,  they  ask  for  only  90%  of  both  re¬ 
sources,  but  there  can  not  be  an  a  priori  expectation  that  one  job  will 
use  the  CPU  while  the  other  is  using  the  channel.  If  there  is  really 
bad  synchronization  between  the  two  jobs,  they  will  often  both  ask  for 
the  CPU  at  the  same  time  or  both  for  the  channel,  so  that  there  will  be 
little  overlap  between  them.  Chapter  III  of  this  thesis  studies  a  model 
of  such  situations,  and  extends  Denning's  multidimensional  Knapsack 
formulation  to  take  care  of  them.  Also  in  chapter  III  is  given  an 
algorithm  to  get  an  approximate  solution  of  the  multidimensional  Knapsack 
under  the  special  circumstances  involved.  This  method  of  solving  the 
Knapsack  has  the  interesting  peculiarity  of  leadinq  to  fair  prices  for 
the  resources  and  the  sets  of  resources  allocated  to  the  users.  In  this 
way,  the  system  can  keep  statistics  of  these  prices  and  use  them  as 
stated  above.  For  thorough  treatment  of  the  Knapsack  problem,  see  [23] 
and  [24]. 

The  "a  priori"  pricing  of  resources  is  also  possible  in  a  climate 
of  space  indivisibility.  The  idea  is  the  following: 


If  there  is  a  cer- 
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tain  set  S  of  resources  available,  the  system  might  expect  to  sell  it 
at  the  average  price  P(S).  If  a  user  asks  for  a  set  S'  of  resources, 
the  system  would  sell  it  to  him  at  a  price 
c  =  k  |p(S)  -  P  <S  -  S'  )j 

where  S  -  S'  is  the  set  of  resources  remaining  after  resources  of  set 
S'  have  been  allocated,  and  K  is  a  constant  greater  than  or  equal  to  1. 

The  precise  function,  P(S),  has  to  be  determined  experimentally, 
by  adjusting  it  to  observed  profits.  Suppose,  for  instance,  that  there 
are  two  resources;  the  CPU  and  the  core.  P(x,y)  is  the  average  profit 
the  system  will  make  out  of  a  percentage  x  of  the  CPU  and  y  of  core. 
Clearly,  P(x,0)  =  P{0,y)  =  0,  because  it  is  not  possible  to  sell  CPU 
without  core  or  conversely  core  without  CPU.  An  example  of  such  a 
function  is  the  cone  represented  in  figure  II-3.  Its  equation  makes 
it  homogeneous  (first  degree)  in  x  and  y: 

p<x,y)  -  VT7  X 

The  coefficients  a  and  b  have  to  be  adjusted  b,  the  system  from 
its  own  experience. 

I I . 4  Indivisibilities  in  time  domain  and  reservations. 

As  an  example  of  time  domain  indivisibilities,  suppose  the  fol¬ 
lowing.  A  program  is  in  core  and  uses  the  CPU.  The  CPU  resource  can 
be  instantly  taken  from  this  user,  but  not  the  core  resource  lest  the 
user's  job  be  destroyed!  The  user  must  be  left  in  core  at  least  for 
the  period  of  time  required  to  swap  him  out  onto  secondary  storage. 

The  problem  of  reservations  is  somewhat  similar.  The  user  who 
comes  to  a  console  wants  to  be  sure  that  he  will  own  the  console  for 
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At  least  a  one  hour  period.  He  :.t*o  wants  to  he  sure  that  he  will  tjt> l 
75K  words  on  tho  disk  for  h  ,  files*  and  at  least  H  of  the  usage  of 
a  compound  resource  (CK'  •  10K  core) . 

It  will  now  he  shown  how  this  example  can  he  handled  by  creating 
a  structure  of  successive  levels  of  allocation.  The  following  ideas 
have  to  be  applied: 

1)  Partial  allocation  decisions  should  be  rade  for  several  tl?w* 
intervals,  some  of  which  are  within  some  others,  huildtnq  a  hierarchy 
in  time.  For  instance,  in  our  example,  the  decision  of  allocatin'}  the 
console  and  some  part  of  the  disk  is  made  for  a  one  hour  interval.  Hew- 
ever,  the  decision  of  allocating  the  CPU  and  eoc^  core  to  a  running 
program  is  rude  only  for  a  one  second  interval. ‘ 


2)  Resources  have  to  ho  pooled  in  order  to  allow  the  user  to 
buy  a  percentage  of  the  pool  in  advance  (1st  level  allocation),  wlthou’ 
knowing  at  that  tine  exactly  when  ho  will  use  his  buying  rower  (2nd 
level  decision) . 

2 

I  am  grateful  to  Professor  Herbert  Simon,  of  Carnegie-hel Ion 
University,  for  having  convinced  me,  in  a  orivnte  discussion,  of  the 
importance  of  multi-level  scheduling. 
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The  user  miys,  At  the  first  level,  a  potent ia!  t*»wcr  to  lw*  used 
at  the  second  level.  Tills  potential  pcwr  can  Itself  Ito  called  a  woney 
(or  a  resource),  which  I#  only  valid  for  bidding  or  buyinu  At  the  sec¬ 
ond  level  «nd  during  the  tiee  durAtion  for  which  the  first  level  deci¬ 
sion  was  nad«r.  hecAuse  this  resource  csists  in  «  United  amount  only, 
the  user  is  guaranteed,  At  the  monent  he  buys  it,  i)ia*  he  lu*  a  certain 
percentage  of  it,  end  thus  a  guaranteed  level  of  service. 

Sole  thAt  tlie  money  dilch  was  used  to  buy  At  the  first  level 

CAnnot  be  u*en  to  buy  At  the  second  level.  One  he*  first  to  buy  the 

intermediate  t/pe  of  noney. 

The  structure  of  AllocAtlen  crested  shove  is  a  liierArchy  of  re¬ 
sources,  which  t*hc*  the  fore  of  a  tree  (fiqv.ro  II-4).  lUieh  resource 
csn  be  used  exclusively  to  buy  resources  which  Ale  under  it  in  the  tree, 
end  only  during  the  tics*  lnterv.il  for  which  the  Allocation  w,ts  Mil**  ai 
the  level  Above.  With  each  node  of  the  tre*  is  AMOCl.ited  a  set  of 

rules,  which  tell  how  it  cen  buy  or  bid  for  the  resources  which  Ate 

under  it  in  %he  tree.  Mote  th.it  scene  part  of  a  resource  wight  he  .it 
sor«  node  of  the  tree,  while  none  other  part  night  be  At  so ee  other  node. 
Tor  ir.stAnce,  a  pert  of  core  memory  nioht  be  available  to  run  ti*e- 
shared  users  voder  a  cert.iin  hind  of  centrACt,  while  Another  part  of 
coro  night  belong  to  a  separate  real -use  user. 

11 . S  More  involved  onnt facts. 

Pcal-ti**e  users  night  want  to  get  a  certain  .mount  of  service  Ire- 
fore  a  particular  deadline.  This  can  be  handled  either  with  a  blddino 
system,  where  the  user  grrtdually  increases  his  h  c?  *  r  systers  rc*ource 
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whon  tho  deadline  approaches  (25),  or  with  a  special  contract,  whore 
the  syotem  takes  the  responsibility  of  finding  out  whether  a  given 
user  can  bo  satisfied  within  tho  current  structure.  Note  that  tho 
scheduling  method  suggested  in  chapter  III  can  be  extended  to  take 
care  of  such  users. 


CHAPTER  III 


AN  ANALYTICAL  MODEL  OF  SPACE  SHARING 

III.l  Introduction . 

This  chapter  attempts  to  show  that  better  schedulers  could  be 

3 

designed  if  some  model  of  the  interference  between  users'  requests 

4 

for  facilities  were  available.  A  program  is  a  string  of  references 

to  certain  resources.  Whenever  a  program  references  a  resource  which 

is  already  totally  allocated,  a  conflict  occurs  and  the  scheduler  has 

to  decide  which  program  will  get  the  resource.  It  will  be  shown  how 

this  can  be  done  in  a  "pseudo-optimal"  way  by  taking  into  account  some 

information  which  can  be  made  available  before  run-time  on  the  patterns 

.  .  5 

of  the  users'  requests  for  facilities. 

Sections  III.l  and  III. 2  introduce  some  of  the  concepts  and  ter¬ 
minology  used  later  in  the  chapter.  Section  III. 3  studies  a  model  of 
"worst  possible"  synchronization  between  the  users'  requests  for  facili¬ 
ties  under  certain  assumptions;  among  which  is  the  assumption  that  re¬ 
sources  are  preemptible  and  each  user  has  a  fixed  priority  for  accessing 
a  resource  (a  given  user  has  a  different  priority  for  each  resource) . 

In  section  III. 4  the  way  in  which  the  previous  model  can  be  used  for 
scheduling  is  examined  by  formulating  the  resource  allocation  problem 

^level  of  conflicts  over  some  period  of  time. 

4 

demand  string 

\his  information,  as  we  shall  see,  is  related  to  the  predicted 
usage  ratios  of  the  various  resources  by  each  individual  program,  in 
the  system. 
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as  a  Mathematical  Programming  Problem  (finding  the  maximum  o L  an  eco¬ 
nomical  function  with  certain  constraints  on  the  variables) .  Then  the 
design  of  an  algorithm  which  yields  a  nearly  optimal  solution  to  the 
M.  P.  problem  is  given.  Section  ill. 5  compares  the  model  of  section 
III. 3  with  models  of  some  other  scheduling  strategies.  The  possible 
fruitfulness  and  extensions  of  this  study  are  discussed  in  III. 6  and 
III. 7. 

There  are  some  resources,  like  the  CPU,  which  can  be  preempted 
(transferred  from  one  user  to  another)  with  less  overhead  than  some 
other  resources  (like  core  memory,  if  the  previous  user  has  to  be 
swapped).  This  leads  to  the  idea  of  having  a  hioraichy  in  time  of 
partial  scheduling  decisions;  some  decisions  being  made  for  smaller 
time  intervals  than  others  (see  sections  II. 4  and  III. 4.1).  The  model 
which  is  studied  in  section  III. 3  will  be  used  in  section  III. 4  to  re¬ 
late  two  levels  of  scheduling.  Whenever  a  decision  is  made  for  a  long 
time  interval  (macroscheduling)  ,  the  scheduler  takes  into  account  some 
information  on  the  future  demand  pattern  of  the  user  during  this  time 
interval.  The  macroschoduler  then  sets  some  parameters  of  the  lower 
level  scheduler  (microscheduler) .6 

Of  course,  solutions  to  macroscheduling  problems  depend  on  some 
information  being  available  on  the  patterns  ni  the  users'  requests  for 
facilities.  Haberman  [2]  has  shown  that  sue!)  information  can  be  useful 
in  avoiding  deadly  embrace  of  processes  in  a  time-shared  environment. 
This  information  might  bo  provided  either  by  the  user  himself,  or 

^Note  that  microschoduling  can  be  done  by  hardware,  which,  for 
instance,  resolves  conflicting  requests  for  a  memory  bank.  Then  the 
software  "sets"  the  hardware. 
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extrapolated  from  statistics  collected  by  the  system. 

What  information  would  be  useful?  It  should  be  relevant  to  the 

scheduler  by  permitting  computation,  for  instance,  of  the  maximum 

possible  "interference"  between  different  jobs.  It  should  be  simple 

and  condensed,  because  the  scheduler  has  to  operate  rapidly,  and  finally 

this  information  should  be  easily  available  and  characteristic  enough 

of  a  given  program  that  it  could  be  used  without  any  modification  for 

several  runs  of  the  same  program  with  different  innut  data.  In  this 

paper,  the  information  used  will  be  the  proportion  of  usacta  of  the 

7 

various  preemptible  resources  over  a  rather  long  time  interval  during 
which  a  job  is  to  run,  and  the  total  (maximum)  amount  of  non-procmpt ible 
resources  required  by  the  job. 

Do  we  really  need  macroscheduling  separated  from  nicroschedulinq? 
In  a  recent  paper  [29],  Stevens  examines  what  was  wrong  with  the 
Chippewa  Operating  System;  he  concludes  that  there  were  two  flaws. 

First,  the  absence  of  a  macroscheduler:  the  Chippewa  system  allocated 
resources  for  an  indefinite  period  of  time,  without  taking  into  account 
the  global  demand  of  each  job.  Thus,  there  was  no  guarantee  when  a  job 
was  allocated,  that  the  job  would  not  ask  later  for  more  memory  than 
was  available,  and  in  this  case  the  Chippewa  scheduler  did  not  take 
back  the  resources  (CPU,...)  already  allocated.  The  second  problem  of 
Chippewa  was  that  I/O  bound  jobs,  or  compute  bound  jobs,  were  not  re¬ 
cognized  as  such  by  the  scheduler,  and  so  this  information  was  not  taken 
into  account  in  assigning  priorities  ror  the  resources.  Wo  will  see 

7 

Preemptible  resources  arc  those  allocated  by  the  microscheduler. 
For  the  moment,  the  reader  might  imagine  the  CPU  as  opposed  to  memory. 


that  better  simultaneity  in  resource  usage  and  between  jobs'  progresses 
is  achieved  by  assigning  a  high  priority  for  a  resource  to  a  job  which 
will  make  little,  use  of  this  resource. 

I I I . 2  Some  Definitions. 

User:  An  entity  which  requests  and  seizes  resources,  and  which 
might  also  give  some  information  about  its  future  resource  requirements. 
In  this  study,  the  words  user,  job,  program  and  process  describe  the 
same  concept. 

Demand  string  of  a  user:  A  program  will  be  considered  as  a  se¬ 
quence  of  calls  to  various  preemptible  resources:  CPU,  I/O,  ...  such 
that  one  and  only  one  resource  is  called  at  a  time  by  a  given  program 
(no  double  buffering,  for  instance).  This  limitation  could  be  removed, 
but  helps  to  simplify  the  presentation. 

Let  R  be  the  set  of  resources.  A  program  is  then  some  strinci 

r,r»...r,  over  R,  whore  r.  means  that  the  user  called  on  resource  r.  as 
1  2  k  1  i 

the  i^  resource  call.  This  notion  is  similar  to  what  Denning  used  in 
more  restrictive  frameworks  to  describe  page  reference  strings. 

Virtual  time  of  a  user:  During  a  certain  real  time  interval  AT  , 
a  user  will  get  the  resources  requested  in  his  demand  string  durino  a 
total  interval  of  time  A.T^ .  Vic  define  AT^  as  the  virtual  time  interval 
corresponding  to  the  real  time  interval  AT  .  Virtual  time  of  a  user 
normally  runs  slower  than  real  time,  but  if  a  user  were  *  ■>  permanently 
have  top  priority  for  accessing  all  resources,  then  virtual  time  for 
that  user  would  be  equivalent  to  ual  time. 

The  virtual  time  diagram  of  a  user  is  a  diaaram  in  which  resource 
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usage  (demand  string)  is  plotted  as  a  function  of  the  virtual  time 
of  that  user. 


C.P.U 

Disk 

Drum 


virtual  tine 


Remember  that  the  assumption  was  made  that  a  program  i.s  a  purely 
sequential  process. 

Observables  of  the  system:  Any  quantities  which  are  relevant  to 
our  study,  among  which  might  be  included: 

1)  The  effective  progress  rate  w^  of  job  If .i  (working  rate).  It 
is  the  portion  of  time  user  #i  was  working,  divided  bv  the  total  real 

time  interval  over  which  the  measurement  was  made. 

_  total  virtual  time  interval _ for  fi.  _ 

i  total  corresponding  real  time  interval 

2)  The  duty  factor  u^  of  resource  #;j  (or  its  proportion  of  usage): 

_  time  resource  #j  is  used  bv  and  iob 
j  total  real  time  interval 

"b  and  u_.  are  both  dimensionless  variables,  which  are  observed 
over  a  certain  interval  of  real  time. 

3)  The  cost  function  (or  economic  criterion)  o*‘  the  system  is 
another  observable.  It  is  assumed  to  bo  a  weighted  sum  of  the  progress 
rates : 

E  =  I  c .  w . 
i  i  i 

where  c^  characterizes  the  urnency  of  user  *fi.  The  precise  mean¬ 
ing  of  c  ^  as  a  bid  will  be  discussed  in  section  I II.  4. 5. 


III. 3  A  inodr  L  based  on  fixed  priorities  (with  preemption)  for  each 


user  and  resource. 

I I I . 3 . 1  Overview  of  the  model . 

For  each  resource,  there  is  a  priority  assigned  to  each  user. 

For  a  given  resource,  these  priorities  a^o  all  different,  (the  users 
being  totally  ordered  with  respect  to  cadi  resource) .  This  priority 
assignment  will  not  be  changed  during  a  certain  time  interval  [0,T] 
over  which  the  conflicts  between  the  demand  strings  of  the  various 
users  are  studied.  If  user  #i  requests  a  resource,  he  will  get  it 
either  if  the  resource  is  currentlv  idle,  or  if  it  is  allocated  to  a 
user  having  lower  priority  for  the  particular  resource  (in  which  case 
the  lower  priorit”  user  will  have  to  wait  for  further  use  of  this  re¬ 
source)  . 

Note  that  a  user  does  not  necessarily  have  the  same  priori  tv  for 
all  resources. 

Let  be  the  proportion  of  the  virtual  time  of  user  i  soont  on 

resource  j  during  the  real  -time  interval  [ 0  , T ]  .  The  a_.  '  s  characterize 

the  needs  of  the  various  users  for  the  various  resources  (for  instance, 

the  degree  to  which  they  are  compute-bound  or  I/O-bound) .  Given  th^ 

virtual  time  diaaram  r(t)  of  user  i,  it  is  trivial  to  compute  his  a.  ,'s 

il 


/ 


1  (  r 


(t)=j  )  dt 


te(t  .  ,t  ] 

min  max 


il 


t  -  t  , 
max  mm 


where 


l(x) 


1  if  n  =  true 
0  if  n  =  false 
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It  will  be  assumed  in  the  sequel  that  a  fairlv  accurate  knowledae 

of  the  a,  ,'s  is  available  (possihly  based  on  east  exnerience  of  the 
il 

programs) ,  but  that  the  precise  demand  strings  and  virtual  time  dia¬ 
grams  of  each  program  over  the  real-time  interval  [0,T]  are  not  known. 

The  situation  may  be  character ized  by  two  matrices  n  x  m,  where 
n  is  the  number  of  users  and  m  the  number  of  resources : 

a^_.  =  proportion  of  the  virtual  tine  of  user  .i  snent  on  resource  j 

p.  .  -  integer  number  renresentina  the  priority  of  user  i.  for  ro¬ 
il  -  ... 

source  j . 

1  <_  i  <  n 
1  <_  j  <_  rn 

0  <  a. .  1 

-  ID  ~ 

Z  a.  ,  =  1  /normalization  of  the  a.  .  for  each  user) 

D  ID  il 

p. .  =  n  i  =  k 

ID  J'O 

p.  .  <  p,  .  <*>  user  i.  has  a  hiqhor  priority  than  k  for  resource  j 

i  i  ^  k  j  — - 

(lower  numbers  higher  priorities)  . 

The  assumptions  are  given  over  a  real-tim^  interval  [0,T],  which 

separates  the  two  activations  of  the  macroscheduling  alnorithm.  If  w 

is  the  progress  rate  of  user  i,  this  user  will  effectively  get  resource 

j  allocated  during  a  time 


T  w .  a  .  . 
i  i] 


(bv  definition  of  a. .  and  w. ) 

il  > 


Resource  j  will  bo  running  durincr  a  total  amount  of  time 

T  u  ,  =  Z  T  w .  a .  . 

1  i  i  1 1 


thus : 


u  .  =  l  a  .  .  w . 
1  i  i  D  i 


(0  <  u  <  1) 
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and,  as  a  consecruence  of  £  a.  .  =  1: 

j  13 

E  w.  =  Z  u, 
i  i  j  3 

The  overhead  of  switching  a  resource  from  one  user  to  another  has  been 
and  will  be-  systematically  neglected  in  the  study  of  the  mode] . 

I I I . 3 . 2  fundamental  equations  and  consequences. 

It  will  now  be  of  interest,  to  derive  the  equations  describing  the 
worst  possible  cases,  where  the  requests,  are  synchronised  in  an  order 
such  as  to  get  the  smallest  possible  progress  rates  and  the  least  oos- 
sible  simultaneous  use  of  the  resources  available.  This  is  relevant 
to  the  general  philosophy  that  the  system  should  alwnvr-  expect,  the 
highest  amount  of  conflict  within  certain  comnuted  bounds.  It  should 
not  oversell  itself  to  the  users,  guaranteeing  them  a  service  that  it 
would  eventual.lv  not  bo  able  to  nivc.  Even  it  the  system  would  decide 
to  take  some  chances  for  a  greater  expected  nrofit,  probabilistic  models 
would  be  dangerous  because  they  assume  a  randomness  and  absence  of 
correlation  between  users  which  are  not  general Iv  true.  Also,  for  a 
given  user,  the  requests  do  not  have  a  random  lennth  under  some  distri¬ 
bution,  and  are  not  uncorrolated  with  each  other.  f  course,  the  com¬ 
puter  could  commute  Marhovchain  coefficients  for  the  demand  strings  of 
the  various  users  and  use  this  information  to  get:  a  bettor  schedule,  but 
this  seems  to  exceed  the  allowable  overhead  of  an  allocator. 

The  following  fundamental  equations  express  that,  in  the  "worst 
possible  case",  a  process  would  bn  wait  ,ng  for  a  resource  at  any  time 
when  this  resource  is  used  by  another  Process  or  h tuber  priority.  Mote 
that  1-w,  is  the  rate  of  waiting  of  user  i. 
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(III-l) 


kj 


w. 


(O^w.  £  1) 
(1  ^  i  ^  n) 


For  given  matrices  (a, .)  and  (p, .),  it  is  alwavs  possible  to  find 

iO  l]  ' 

virtual  time  diaarams  of  the  users,  which  have  the  usage  ratios  a.  .  for 

'll 

the  resources,  and  such  that  each  job  might  hive  the  maximum  waiting 

rate  given  bv  eouations  III-l.  In  other  words,  ir  the  coefficients  a.  . 

‘  il 

are  known  for  each  job,  but  not  the  exact  virtual  time  diagrams  of  the 
jobs,  it  can  be  said  a  nriori  that  the  jobs  will  have  progress  rates  at 
least  equal  to  the  w, 's,  if  and  only  if  the  foil  oaring  equations  are 
satisfied: 


( 1 1 1  —  2 ) 


1  -  w.  \  min  (  /_  a,  .  w.  ,  1  | 

\  TTk  k3  *  ) 


(  V  i,  1  £  i  ^  n) 

(  0<w.  <  1  ) 


Pkj<Pij 


An  equivalent  form  is  given  in  the  followina  equations: 


(III- 3) 


Vi,  e 


either  w.  =  0  or  1>.  w.  + 
- —  1  —  v  1. 


V 

y 

j  »k 


“kj  Wk 


(w  >  0) 


Pkj<  Pij 


Equations  ( 1 1 1 — 2 )  define  a  domain  of  values  for  the  vr's.  Anv 
point  within  this  domain  can  alwavs  be  reached  if  the  system  should 

g 

desire  it.  This  domain  will  he  called  the  attainable  domain,  or 

g 

The  action  to  be  taken  bv  the  system  to  reach  a  nartioular  point 
in  this  solution  space  will  be  described  in  section  ITT. 4.  A  formal 
proof  of  this  statement  is  not  given  here. 
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synonymously  the  domain  of  certainty.  Note  that  eeuntions  ( 1 1 1 — 2 ) 
imply  that: 

( 1 1 1— 4 )  0  <  u.  =  Z  a.  .  w.  <  1  (0  <  w,  <  1)  (1  <  i  <  m) 

-  i  i  n  t  --  -  i  -  -  - 

(The  reader  will  find  this  result  easy  to  orovo) . 

I I I . 3 . 3  Dof ini tion  of  the  mathematical  problem. 

The  previous  model  will  be  used  to  find  a  set  of  users  to  allo¬ 
cate.  Each  user  gives  to  the  system: 

n 

1.  his  usage  ratios  for  the  various  nreenet ible  resources:  a...' 

■  n 

2.  his  uroenov:  c,.  A  hioher  value  of  c.  '"'ans  a  higher  urnonev, 

j  i 

but  also  means  that  the  user  is  willing  to  oav  more  money  in  order  to 

run.  It  is  understood  that  if  a  user  with  urgency  c.  nets  a  prooruss 

rate  w,  durina  a  real  time  interval  of  length  T,  then  this  user  is 
l 

willing  to  pay  at  most 

c  .  w .  T 
l  l 

to  run  during  this  time  interval.  Pricing  strategics  are  studied  in 

III. 4. 5. 

The  mathematical  programming  problem  can  be  stated  in  several 
forms  of  varying  complexity. 

1.  Given  the  a,  ,'s  and  the  c.'s,  find  the  p.  ,'s  and  the  w.'s 
lg  i  il  i 

which  maximize  the  economic  criterion  (or  cost  function): 

(III-5)  E  =  Z  c  w 

while  satisfying  eauations  (IIT-2)  . 

Note  that  the  economic  criterion  chosen  is  equivalent  to  a  cri- 
9 

and,  in  the  second  formulation,  lus  usage  of  the  non-precmotible 
resources :  b ,  ,  . 

n 


4r> 


torion  which  would  tend  to  maximize  resource  usage.  Suppose  it  is 

desired  to  maximize 

!•!  =  ^  d  ,  u . 

.1  3  3 

where  is  the  "woiqht"  or  cost  of  resource  ** i.  L  can  then  be 
rewritten  in  the  form: 


F.  = 


Z 

i 


c . 

i 


w, 

1 


wi  th : 


c .  =  £  a  .  d . 
L  j  *1  3 


2.  In  this  second  formulation,  there  are  two  hinds  of  resources: 
the  macroresourcos ,  (which  are  non-oroomntible^  v»d  i  Unrated  bv 
the  macroschedulcr) ,  and  tim  microreaourccn ,  CL (preempt ible  and  allo¬ 
cated  by  the  microschcduler)  .  h.  will  bo  called  the  a'-uolut  *  amount 

t.l  - 

of  macrorosource  i  desired  by  user  i.  Rv  contrast,  a.  .  in  the  relative 

n  - 

amount  (per  unit  of  virtual  time)  of  microresource  i  needed  bv  uvr  i. 

Now,  the  mathematical  programming  problem  can  be  expressed  in 
the  following  way: 

Find  a  cet  of  users  S,  and  matrices  (n. .)  and  (w. ) ,  which  maximize 

n  i 

the  economic  criterion  (cost  function) : 


( 1 1 1- 5 )  F  =  .  >-  c.  w. 

ier.  j  j 

subject  to  the  followinq  constraints 
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The  non-prenmjitiblo 
cells  at  various  levels  of 


resources  night  be, 
memorv  (core,  drum,. 


for 

.)  . 


i nstnnce , 


momor” 


4' 


I' 

(in-6)  < 


V  j  €<6  Z  b  S  £  H 

ic  s  13  1  J 
whcro  I  *1  *  0  lf  “1  ■  0 

l  Oj  41  1  if  wt  >  0 
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Vi  €  S  , 

s. 


*  VjV  <  1 

pi*  j<pi  j 


B.  is  the  tot.il  available  Amount  of  n  >n-rre<  — »r  iule  resource  *j. 


In  this  section  are  given  a  few  examples  of  .1  Graphical  repre¬ 
sentation  of  the  attainable  do-iin  in  the  space  of  th«  w^’s,  in  order 
to  create  a  feeling  of  how  the  demand  strims  of  the  io‘.>n  will  syn¬ 
chronise.  The  render  will  discover  that  the  "worst  case  domain"  can 
nevertheless  lend  to  a  lot  of  simultaneity  between  the  lobs.  It  war 
verified  bv  simulation  that  the  iohs  do  not  general Iv  synchronise 
significantly  hotter  than  the  worst  case  model  predict*,  i*  there  ir.  a 
small  number  of  preemptible  resources  and  if  the  priorities  are  chosen 
in  a  noarlv  0ntin.1l  way  (see  section  111.4.2). 


Example  1. 


CI'M  PI'-.K 


.ion  1 

JOB  2 


**The  equations  ror  jeft  cxrrnss  that  nnv  user  hnvinn  a  non-?.e*-o 
progress  rate  (and  thus  allocate'!  bv  the  macros  -h'-du lor,  cm  rind  the 
space  he  needs  in  memory  level  #■}. 
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In  thin  arrav,  a  job  in  .1  row  and  a  resource  is  .1  column.  Iv- 
nourco  1  is  the  CPU,  resource  2  in  the  disk;  job  1  i r.  commute  bound, 
job  2  in  more?  I/O  ’.>ound. 

1.  The  maximum  priority  in  oivon  to  iob  1  for  nil  resources: 

(V  "  (2  ’) 

Equations: 


Tho  donain  of  certainty  is  defined  hv:  w  ♦  w^  <_  1 ,  which  shows 
that  no  parallelism  in  the  use  of  tho  resources  is  obtained  in  tho  worst 
case  (fig.  III-l). 

2.  Sunnosc  that  tho  Dr i or i tv  matrix  is: 


"V "(?  •') 


Equations  defining  the  attainable  domain  are: 


1 

> 

w. 

+  .4  w. 

w,  :•  0 

1 

1  - 

1 

> 

.2 

W,  4  Vf 

w  >  0 

1  2 

*)  — 

•« 

-  0 :)(:;)  ‘  0 


The  attainable  domain  corresponds  to  a  noarlv  optimal  usatte  of 
tho  rosourccr. : 


U1  "  wl  *  w2 

u.  ■  .2  w,  ♦  1G  w_ 
2  12 


In 

however, 

u,  snace 

1 


and  u^  are  maximum  at  the  {waint: 

w^  *  . Cj*S  ,  w^  n  .*37  »  .  ft 7 ,  u,  -  .(i~> 

this  case,  both  and  u^  are  maximum  at  the  same  mint.  This, 
is  not  a  ocneral  result,  and  very  comlicated  domains  in  the 
nioht  exist.  Nevertheless,  solvim  tho  equations: 


'/  /  attainable  domain 

ft 

//j  (equations  (III— 2) ) 


(p.  . ) 

11 


2 

1 


Fig .  III-l 

Domains  in  the  (w  ,  w  )  space  for  example  1. 
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Vi,  1  =  w.  +  7  a,  .  w, 
1  jTk  ^  k 

pxj<pij 


might  give  a  good  approximation  of  the  use  of  resources  in  the  attain¬ 
able  domain. 

3.  The  next  case  has  the  priority  matrix: 


Obviously,  the  attainable  domain  is  worse  than  with  the  second 

priority  assignment,  but  better  than  with  the  first  one. 

If  the  objective  function  to  be  maximized  is  E  =  w^  +  w^ 

(c.  =  c„  =  1)  ,  then  the  priority  assignment  ffl  yields  K  =  1, 

1  2  max 

priority  assignment  #2  yields  E  =  1.52  and  orioritv  assignment  #3 

yields  E  =1.15.  The  solution  of  the  mathematical  nroqramm.ina  pro¬ 
max 

blem  defined  in  tho  previous  section  would  be  the  second  oriority 
assignment : 


and  w^  =  , 65 ,  w2  =  .87. 


Example  2 : 

This  example  considers  two  jobs  havino  identical  average  resource 
usaae  characteristics,  but  one  job  seizes  each  resource  during  a  much 
shorter  amount  of  time  than  the  other  job. 


50 


Fig.  Ii'I-2.1  and  III-2.2  show  the  virtual  time  diagrams  of  rich 
job.  Fig.  III-2.3  and  III-2.4  show  how  thov  synchronize  in  real-time 
under  two  different  priority  assignments.  The  exnected  progress  rates 
are  found  to  be  those  given  by  the  "worst  case"  model. 

For  this  example: 

(a.  . ; 


(P 

thus  E 

(p.  .)  =1 


v  •  (:!  :0 
n>  Is !) 


1  ™  W1  +  w2 


=  w,  +  w_  =  1  (w  -  1 
max  12  1 


w2  a  0)* 


H  ■;)$>(:) 

thus  E  =  1.33  (with  w  =  .67,  w  -•  .67). 
max  1 


Example  1  has  a  larger  attainable  domain  and  a  larger  than 


example  2  with  the  priority  assignment  ^  This  is  due  to  the 

fact  that  the  job-  of  example  1  are  complementing  each  other  (one  need: 
more  CPU,  the  other  more  I/O),  while  jobs  of  examnle  2  have  .identical 
average  needs  of  resources. 


III. 3. 5  Multiprocessor  case. 

So  far  only  the  case  where  the  resources  are  not  interchangeable, 
and  are  only  susceptible  to  one  activation  at  a  time  has  been  considered. 
How  the  previous  model  can  be  extended  to  the  case  where  some  resources 
may  have  more  than  one  activation  at  a  time  will  now  be  studied.  For 
instance,  there  might  bo  several  identical  CPU's  or  identical  channels. 

The  fundamental  "worst  case"  equations  arc  quite  comolicated. 

They  are  given  here  without  further  justification. 


virtual  time 
user  1. 


Fig .  ITI-2.1:  Virtual  time  diagram  of  user  1. 


resources 


virtual  time 
user  2. 


Fig.  III-2.2:  Virtual  time  diagram  of  user  2. 


CPU,  user  1 
CPU,  user  2 


resources 


I/O,  user  1 
I/O,  user  2 


Fig.  IIT-2.3:  Real  time  diagram: 


(n.  .)  = 


il 


Fig.  III-2.4:  Real  time  diagram; 


<Pi1>  " 


user  waitinq  for  the  resource 
users  seizing  the  resource 


5.3 


Define  q  ,  by:  r  =  p, ,  <=>  i  =  q  . 

1rj  i]  nr;j 


re  [l,n]  ,  je  [l,m] 


q  .  is  the  number  of  the  user  havinq  the  r-th  nrioritv  for  resource  i. 
rj 

If  resource  j  has  processors  (possible  simultaneous  activations) , 
the  maximum  time  that  user  i  would  spend  waiting  for  resource  j  in  time 
interval  [0,T]  is: 


k  €  [  1 , R_.  ]  \  Rj  k+1  m€[k,r-l]  Snj 


»  ) 


if  r=q. .>  R 
ir 


0  if  q.  .  <  R. 


so  that  the  fundamental  equations  for  the  attainable  domain  are: 


V  i  €  [l,n]  either  1  -  w.  /  7? .  .  or  w.  =  0  (always  w.  'Z  0) 

-  l  .  rT7  ,  li]  i  i 


j  £  [l,m] 


Theorem  1.  A  smaller  domain  than  the  attainable  domain  defined  by  the 

previous  equations  can  be  defined  by  equations  ( I X I— 2 )  where  a^..  has 

been  reolaccd  by  c<. .  =  aij  . 

il  — - 

'i 


Proof:  by  choosing  the  first  of  the  quantities  whose  minimum 


is1 h: 


y . .  4  — 

I  lj  N  R 


R.  .  -  r  i  , ,ai j  Wi 

3  i£[l,r-lj  J 


if  r  =  q. .  >  R. 

13  /  3 


and  77  ,  .  =  0  ^  — r~  ^  a.  ,  w.  if  r=  q.  .  ^  R. 

'i3  v  R.  .  ,“7l  ,  ,  13  1  13  N  3 

J  3  1  C-  l,r-l]  J 


hence  the  theorem. 
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By  replacing  the  a^'s  by  the  oC's  the  multiprocessor  case  has 

been  reduced  to  a  monoprocessor  case.  The  accuracy  of  this  aonroxima- 

tion  can  be  felt  even  more  in  the  special,  but  very  important  case 

where  a.  .  <  a  <=>  p.  .  <  p  .  In  that  case,  if  r  =  q,  ,  >  R.: 

ID  k}  ID  k:  j 


i  €[l,r-l] 


ID 


w. 

1 


and,  for  r=q..  <  R .  ,  T?  .  .  =0,  but  then  the  a.  ,'s  are  small 
ID  ~  D  I  ID  ID 

too,  because  they  yield  high  priorities.  This  leads  to  the  intuitive 

feeling  that  for  this  priority  assignment  and  a  large  number  of  users 

(n  >>  R.),  the  exact  model  and  the  model  approximated  bv  theot. .'s  are 
D  ID 

very  similar  to  each  other. 

There  is  another  way  of  approximating  the  multiprocessor  case  by 
a  monoprocessor  case.  Suppose  resource  j  to  be  a  single  resource  (one 
activation  only),  but  which  works  at  R^  times  its  initial  speed.  The 
fundamental  equations  would  then  be: 


-  w! 


>/£t 

D#k 


Ah  K 


pkj<pij 


where  S',  ,  is  the  normalized  value  of 
kD 


Jl 


R. 

D 


Z 

j' 


V 
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The  progress  rates  w\  apply  in  a  universe  where  the  resources 

work  faster  than  initially.  The  correspondence  between  w'  and  w  is 

1  i 

given  by: 


w !  =  w . 
1  l 


Z. 


a .  , 

_L1 


R. 

3 


Thus,  the  fundamental  equations  for  the  processors  workinq  t 


Rj  times  their  initial  speed  are: 


The  reader  can  see  that  this  is  a  more  optimistic  estimate  than 


was  given  by  theorem  1  and  the  o< 


13 


s. 


Examole  3. 


a.  , 

13 

CPU 

Bus 

job  1 

.8 

.2 

job  2 

.4 

.  6 

job  3 

.2 

.8 

Suppose  that  2  CPU's  are  available.  The  following  priorities 
are  assigned. 

and  r  =  w  +  +  w^  is  to 

be  maximized. 

1.  An  exact  treatment  gives  the  fundamental  equations: 
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1  >_  +  min  (.2  w^  +  .1  w^) 


1  >  .2  +  w2 


1  >  .2  w,  +  .6  w_  +  w_ 
1  2  3 


yielding  the  solution  (with  equalities): 
/ 


W1  = 

.94 

It 

CM 

2 

1 

2  U1 

=  .57 

U2  "  * 

» 

E 

max 

=  2.07 

The  lower  bound  method: 

3ij 

Ri 

CPU 

Bus 

job 

1 

.4 

.2 

job 

2 

.2 

.6 

job 

3 

.1 

.8 

W1  " 

.80 

w2  = 

.80  w3  = 

1 

2  U1 

=  .52 

U2 

=  .93 

E 

=  1. 

96 

W3  = 


.32 


is  the  "best"  point  of 
the  worst  case. 


max 


3.  The  "optimistic"  approximation  with  a  CPU  workino  twice  as 


fast: 


thus : 


w^  =  1.33  w^  =  .63  W3  =  .21 

E  =2.44 
max 


Note 


that  w^  is  greater  than  1,  which  is  not  surprising  with 
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the  assumptions.  If  the  constraint  vr  <  1  is  imposed,  the  optimum 
would  be: 

=  1  w2  =  .97  w3  =  .24 

E  =  2.21 
max 

which  is  a  closer  approximation  of  the  case  with  two  processors. 

1 1 1 . 4  Macroscheduling  and  microscheduling  algorithms  under  the 
previous  model. 

This  section  is  intended  to  show  how  the  model  of  section  III. 3 
can  be  applied  to  multi-level  scheduling.  Section  III. 4.1  shows  how 
the  schedulers  look  from  the  users  point  of  view.  An  attempt  is  i_hen 
made  to  solve  the  mathematical  programming  problem  by  separating  the 
assignment  of  priorities  from  the  computation  of  the  proqress  rates. 

A  heuristic  for  assigning  priorities  is  given  and  justified  in  III. 4. 2. 
Once  priorities  are  chosen,  the  problem  is  reduced  to  a  multi -dimen¬ 
sional  Knapsack  problem,  for  which  a  heuristical  solution  is  proposed 
in  III. 4. 3.  Section  III. 4. 4  summarizes  the  results,  and  III. 4. 5  shows 
how  the  users  could  be  charged  at  "marginal  prices";  the  prices  for 
services  being  determined  in  connection  with  the  aloorithm  of  III. 4. 3. 3. 

III. 4.1  Combining  two  levels  (in  time)  of  scheduling. 

When  designing  an  operating  system,  one  of  the  major  difficulties 
is  to  partition  the  concepts  involved.  This  requires,  in  particular, 

the  separation  of  tasks  which  are  loosely  connected,  and  the  imolcmcn- 

•  1 r> 
tation  of  them  as  separate  processes  of  the  svstem.  “  it  is  assumed 

“  12 

This  can  bo  achieved  with  a  hierarchical  structure  such  as  the 
one  proposed  by  Dijkstra  [26]  ,  with  a  communication  svstem  between 
processes  [27]. 
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he.ro  that  the  following  decisions  and  strategies  are  independent  from 
resource  allocation  (or  scheduling)  ,  except  that  thev  mav  provide  in¬ 
formation  or  requests  for  the  scheduler,  but  they  should  not  be  con¬ 
fused  with  scheduling  activities: 

1.  Page  replacement  algorithms  in  a  computer  wi th  paged  memory. 
Which  page  should  be  extracted  from  memory?  Should  the  size  of  the 
working  set  of  pages  be  changed?  Should  there  be  an'/  preparing? 

These  decisions  can  be  made  bv  the  programmer ,  or  bv  the  system, 
but  in  any  case  they  concern  program  optimization  and  no_t_  system 
optimization  (insofar  as  it  can  be  said  that,  the  system  is  not:  improved 
by  improving  the  users'  programs  running  under  it). 

2.  Docidina  on  the  "externa]'1  priorities  or  jobs.  Some  jobs 
are  more  urgent  than  others.  This  might  be  decided  either  by  the  sys¬ 
tem  or  lay  the  user  (which  is  wilt  inn  to  may  more  to  not  his  job  ex¬ 
ecuted  soon).  External  priority  can  bo  reduced  to  the  oconomi e  cri¬ 
terion  of  the  price  offered  by  the  user  per  unit  computation  of  his 
job,  which  is  then  fed  into  the  scheduler,  and  rfil.l  serve  the  scheduler 
in  order  to  build  its  own  optimization  criterion. 

3.  Statistics  to  bo  used  bv  the  schedule’  or  bv  the  naaina 

algorithm  or  to  compute  external  priorities,  can  theoretically  be 

considered  t;o  be  collected  independently  or  those  decision-making 
13 

processes . 

A  scheduler  will  be  considered  to  be  a  mechanism  using  the 
following  information: 

1  3 

If  the.  user  wants  to  collect  such  statistics,  he  ’'/ill  have 
to  pay  for  the  resources  involved  in  the  spy  in  i  :  toco.?". 


i'  I 


1.  On  each  program:  some  information  about  the  kind  of  service 

wanted  bv  the  program,  either  on  a  lone  term  range,  or  because  of  a 

current  request  for  a  facility.  The  a. ,’s  of  section  III. 3  were  an 

^  3 

example  of  such  long  term  range  information. 

2.  The  economic  "bid"  of  each  job,  characterising  its  "external” 
urgency. 

3.  The  resources  available  to  the  system. 

I  believe  that  some  scheduling  should  be  done  for  intervals  of 
various  duration  of  time.  Tor  instance,  microscheduling,  ns  defined 
in  the  sequel,  might,  be  done  for  periods  in  milliseconds,  macroschoduling 
for  periods  in  hundreds  of  milliseconds ,  while  .scheduling  of  tanos 
should  be  done  for  minutes,  and  some  real  time  users  may  not  want  to 
use  the  system  at  all.  unless  they  are  assured  of  gottina  some  minimum 
guaranteed  resource  usage  during  a  whole  hour. 

A  scheduler  decides  to  allocate  some  resources  to  some  users,  and 
chooses  parameters  to  be  fed  into  the  "lower  level"  scheduler  (which 
handles  smaller  time  intervals) . 

Hicroscheduling  and  Macroschoduling . 

Examples  will  now  be  given  of  how  some  usual  allocation  decisions 
can  be  split  between  two  schedulers  working  at  different  levels  of'  tim° 
intervals.  Those  examples  are  summarized  in  table  1TT-1.  Resources 
allocated  by  the  low-level  scheduler  (microscheduler)  are  said  t.o  b 
preemptible,  and  those  allocated  by  the  mncroschodul-'r  are  said  to  be 
non-preemptible. 

14 

Frocmptibj.il ty  is,  of  course,  a  relative  notion  (and  not  abso¬ 
lute).  It  can  just  bo  stated  that  some:  resource  are  more  nreemrtible 
than  others,  if  the  overhead  to  allocate  then  is  snalLo*". 


Current  Computer 

Systems 

Future  Computer 

Systems 

Macroschedulinq 

Allocation  of 

core  memory 

Allocation  of  core 

memory  and 

*"ast  reqisters 

Time  between 

macro-decisions 

-  1  sec . 

10  mill  isoc . 

Microschedulina 

Conflicts  of  accesses 

to  drum  and  disk 

Allocation  of  CPU 

and  fas.  reoistars 

Allocation  of  CTU 

conflicts  in  swat-nina 

-r tween  central  core 

and  fast,  reoisters 

Time  between 

microdecisions 

-  10  mi  1 1 isoc. 

-  1 00  microseconds 

Table  I1I-1 


Examoles  of  macro  and  micro  schcdul Lnc 


Macrorcheduling  can  refer  to  those  sehodul inn  ooorations  which 
are  related  to  the  allocation  of  central  memory.  The  time  interval 
between  two  macrodocisions  would  be  rather  large  (greater  than  100 
milliseconds  on  most  systems).  Microschoduli ng  would  concern  the  allo¬ 
cation  of  the  arithmetic  and  control  unit-;  and  of  some  fast  busses, 
to  programs  which  are  already  essential lv  present  in  central  memory. 

For  instance,  the  decision  of  what  a  oh  is  allocated  a-cens  t.n  the  drum 
for  pago-in  and  page-out  one rati  'ins  between  drum  and  main  core  memory 
is  a  microscheduling  decision  in 'current  commuter  sv«; toms  where  the 
programs  are  kept  in  core  while  paging  takes  Place.  in  future  commuter 
systems,  this  kind  of  paging  will  most  nrobabl'*  be- replaced  bv  a  paaina 
between  two  fast  levels  o<  memory,  like  on  the  380/85. 

I  prefer  the  words  "microschcduli ng"  and  "racroscheduli nc<"  to 
"microqueuing"  and  "macrc  .juouino"  (5],  because  the  latter  suggest  the 
use  of  FIFO  queues  bv  the  scheduling  algorithm,  which  is  a  practice 
that  this  chapter  precisolv  tries  to  discredit,  goto  that,  in  our  term¬ 
inology,  "scheduling"  and  "allocating"  are  synonvmous. 

The  macroschedul or  receives  the  predicted  probable  usage  ratios, 

,  and  the  urgency,  c^,  for  each  user  fi  who  wants  to  run.  Then  the 

macroschoduler  will  solve  the  Mathematical  Pronrammi no  problem  (defined 

in  III. 3. 3).  having  determined  the  sot  of  users  to  he  allocated  durina 

a  certain  real-time  interval  of  length  T,  and  the  matrices  (p. .)  and 

1 1 

(vb  )  for  the  users  in  this  set,  the  macroschedul or  assures  that  the 
non-prcenptiblc  resources  will  be  allocated  to  those  users,  and  transfer 
the  values  of  (rm^)  and  (w^  )  t.o  the  mi  crosc’iedul  or . 

The  microschoduler  controls  the  access  of  the  u  -ers  to  the  ore- 
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emptible  resources,  by  applyinq  the  priorities  which  wore  determined 

by  tho  macrose'nedulcr.  It  also  stops  a  usor  tfi  if  this  user  runs  more 

than  a  time  w^T.  Finally,  it  prevents  the  users  from  exceeding  their 

_  15 

predicted  resource  usage  ratios. 

Sections  III. 4. 2  and  III. 4. 3  will  he  devoted  to  the  solution  of 
the  M.P.  problem  by  the  macroscheduler. 


I I I . 4 . 2  Assignment  of  priorities. 

It  is  interesting  to  try  to  find  a  nr i or. i  tv  assignment  (n.  J 
before  determining  the  progress  rates  (w^)  which  ontimize  the  cost 
function  E  -  E  c.  w. . 


1 1 1 . 4 . 2 . 1  A  case  where  a  given  assignment  of  pi:  iorities  can 
certainly  be  imnrovod. 

Let  P  be  called  the  attainable  domain  under  priority  assignment 
P 

p.  It  will  now  be  shown  that  the  assignment  to  each  job  of  a  set  of 
the  same  priorities  for  all  resources  is  a  wrong  choice,  which  can  al¬ 
ways  be  improved.  Consider  the  following  theorem: 

Theorem  2:  Let  I  be  a  prioritv  assignment  in  which  <  P^, •  for  all 

i.  If,  for  some  i,  p.  .  .  =  p.  •  +  l,t.hon  there  exists  another  assignment 
j  J  -  i  '  3  r  1 1 

P'  which  is  the  same  as  2  except  that  n  and  o  are  interchanged, 

such  that  D  .  oroperly  contains  P  . 

P  P 

In  other  words,  any  point  in  the  seace  of  observables  which 
satisfies  equations  ( 1 1 1— 2 )  under  assignment  P  ,  will  satisfy  (1 1 1 - 2 ) 
under  priority  assignment  P'. 


15  .  . 

See  section  I II.  4. 4 

algorithm. 


for  a  precise  sketch  of  the  microschedul.ina 
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Corollary:  the  maximum  of  the  cost  function16  under  priority 

assignment  P'  will  be  greater  than  or  equal  to  its  maximum  under  P. 

The  proof  of  the  theorem  is  given  in  appendix  A.  The  theorem 
is  quite  weak,  but  at  least  it  shows  that  a  priority  assignment  like 
the  following  can  certainly  be  imoroved: 

1  1  t 

2  2  2 

3  3  3 

4  4  4 

III . 4. 2 . 2  Two  cases  where  it  is  known  how  to  assign  priorities. 

1.  The  users  are  competing  on  one  resource  nnlv. 

There  might  be  more  preemptible  resources  in  the  system,  but 
for  each  of  the  other  resources  there  is  no  more  than  one  user  who 
might  ask  for  it. 

In  this  case,  the  fundamental  equations  take  the  form: 


Viit’  n]  ,  either  w.  =  0  or  1  -  w.  >,  T  a  .  w 

-  l  —  i  / 


k  j  k 


pp<pii 


where  j  is  the  critical  resource,  and: 


E  =  Z  c .  w . 
i  i  i 


(O^w.  <  1) 


is  to  be  maximized. 


Theorem  3 :  The  optimal  priority  assignment,  P,  is  such  that: 


pij<p*j 


<-> 


a . 


ii  <  _!ni 

ck 


The  proof  is  sketched  in  appendix  B. 


16 


or  economic  criterion. 


2.  There  is  a  finite  number  of  users  and  resources,  but  the 


a. ,'s  are  infinitely  small  (of  the  first  order). 

“ID - 

Under  these  assumptions,  1-vr  is  a  first  order,  infinitely 


small  number.  Thus: 


1  -  w. 
1 


rj 


Z 

j  /k 


kj 


Pkj<Pij 


and: 


n  -  E 


max 


£ 

i ,  j»k 

Pkj  ^  Pi j 


c .  a 
i  kj 


The  question  of  which  priority  system  makes  n-r’mx  smallest 
possible,  still  exists.  The  following  theorem  solves  the  problem: 

Theorem  4;  The  optimal  priority  assignment  P  is  such  that,  for  any 
resource  j  and  any  users  i  and  i', 


Pij^  Pi' 


Proof : 


n 


E 

max 


~  I 

j 


Z 

i  »k 


c . 

l 


a 


kj 


Thus,  the  problem  of  finding  the  optimal  priorities  can,  in  this  case, 


be  solved  for  each  resource  independently  from  the  values  of  the  a  's 

i-i 

and  P^j's  for  the  other  resources.  Theorem  3  can  then  be  apnlied,  and 
gives  the  optimum  priority  assignment  for  each  resource. 


I I I. 4. 2. 3  General  case. 

The  general  problem  of  assigning  priorities  is  generally  quite 
complicated.  Using  the  results  of  sections  III.  4. 2.  I  and  III. 4. 2. 2, 
suggests  the  following  heuristic: 

1.  Assign  priorities  so  that: 


Pij^  Pkj 


a.  .  a,  . 
1.1  _ kj 

c .  c, 

l  k 


2.  Try  to  improve  this  priority  assignment  bv  usintt  theorem  2. 
This  improvement  can  be  achieved  in  a  time  proportional  to  m  x  n. 

This  priority  assignment  is  not  always  optimal,  as  shown  by  the 
following  counter-example: 


Example  4 :  3  resources,  2  jobs. 


(a.  .)  = 
il 


.3  .3  .4 

.31  .6  .09 


It  is  desired  to  optimize  w^  +  w^.  The  proceeding  method  loads  to  the 
following  assignment  of  priorities: 


in.  ■) 

il 


1  1  2 
2  2  1 


The  optimum  is  w^  +  =  1.384. 


However,  with  the  priority 


assignment: 


2  1  2 

(pij}  -[ill 


the  optimum  would  be:  w^  +  w^  =  1.477. 
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This  example  clearly  shows  that  the  "good"  priority  assignment 
is  not  always  optimal.  The  major  advantage  of  this  method  of  assigning 
priorities  is  simplicity. 

II I. 4. 3  Assignment  of  values  for  the  progress  rates  w^. 

Once  the  p^'s  have  been  determined,  it  is  desirable  to  determine 
optimum  values  for  the  progress  rates  in  order  to  maximize  E  while  sat¬ 
isfying  equations  (III-2) . 

1 1 1 . 4 . 3 .  .1  The  0/1  integer  linear  prograinmi  na  problem. 

Now  the  mathematical  programming  problem  which  was  defined  in 
part  2  of  section  III. 3. 3  will  be  considered:  maximize  ( 1 1 1  —  5 )  sub¬ 
ject  to  the  constraints  (III-6)  (except  that  the  o. ,'s  have  already 

i.l 

been  determined) . 

This  mathematical  programming  problem  is  not  a  0/1  integer  linear 
programming  problem,  but  it  is  convenient  to  consider  it  as  such  (de¬ 
termine  the  values  of  the  5 . ' s  ooual  to  0  or  1).  Mote  that  the  a. .'s 

i  il 

and  the  are  all  positive  (they  represent,  the  needs  of  the  users 

for  preemptible  and  non-preemptiblc  resources) . 

The  0/1  integer  linear  programming  problem  has  been  reviewed  in 
[23].  Of  particular  interest  to  us  arc  the  studies  in  [24,28].  The 
idea  is  to  find  a  nearly  optimal  set  of  users  to  be  allocated 
(<fh  =  0  or  1)  by  ordering  the  users  according  to  some  criterion  which 
will  be  called  "decreasing  desirabilities",  and  to  trv  to  allocate 
them  (satisfy  the  constraints) .  starting  with  the  user  havino  the  high¬ 
est  desirability. 


The  jobs  are  supposed  to  be  already  ordered  by  decreasing  ex¬ 
ternal  priorities.  It  is  then  necessary  to  decide  about  the  w  's  for 
the  jobs.  For  instance,  suppose  w^  =  .5  for  the  high  priority  jobs. 

A  w^  too  close  to  1  might  strongly  degrade  <  lie  possible  service  for 
other  jobs,  by  obliging  the  system  to  give  a  hi  ah  internal  priority 
for  all  resources  to  the  job  which  has  a  high  external  priority.  This 
would  lead,  as  has  been  seen,  to  a  poor  utilization  of  the  resources. 

If  a  set  S  of  jobs  is  allocated  in  memory,  it  has  to  satisfy 
equations  (II 1—3 )  and  (III-6)  (with  <? ,  -  l  if  i£.c'»,  6.  =  0  otherwise). 

A  procedure  to  find  the  maximal  set  of  users  fitting  into  avail¬ 
able  resources  would  take  the  following  steps: 


Take  the  highest  priority  user:  put  him  in  set  S'. 


Step  2:  Check  whether  S'  is  an  allowable  set:  first  assign  the 


priorities  p._.  tyies',  according  to  the  rule 


p .  .  _  p,  .  i  1 

<  rkg  <=>  - -  < 

c . 

i 


where  the  c.'s  are  in  the  same  order  as  the  external  priorities. 

i 

Then  check  equations  (III-3,  and  III-S)  for  set  S'.  If  they  all  check, 
go  to  step  3,  else  go  to  step  4. 

Step  3:  S  *-  S';  go  to  step  4. 

Step  4 :  Define  S'  as  including  all  users  of  S,  plus  the  hiahest 
priority  user  not  yet  handled.  If  there  are  no  more  users  to  handle, 
the  algorithm  stops,  else  go  to  step  1. 

Using  the  above  procedure  a  maximum  allowable  set  of  users  has 
been  found,  eacli  of  which  has  a  requested  guaranteed  service.  The 


computations  can  be  done  so  that  the  time  reaui red  bv  the  algorithm 
is:  t=Amn+Bmn  log(n).  The  nxloq(n)  term  expressed  the  time  to 

sort  the  quantities  ai j  (to  determine  the  priorities) . 

c . 

1 

Set  of  users  with  non-guaranteed  service. 

Assume  that  the  non-preemptiblo  resources  are  not  saturated  after 
having  applied  the  previous  macroscheduling  nlaorithm.  Some  other 
users  might  then  bo  allocated  with  priorities  lower  (for  each  pre¬ 
emptible  resource)  than  the  lowest  priority  of  the  users  of  set  S.  The 

guaranteed  service  of  users  of  set  S  will  not  bo  affected  by  these 

additional  ("Marginal")  users.  M  will  be  the  sot  of  narninal  users. 

Priorities  in  sot  M  are  determined  accordina  to  the  same  criteria 
as  in  set  S.  Of  course,  the  resource  usage  will  not  bo  as  qood  as  i f 
the  priorities  had  been  determined  optimally  for  the  entire  set  M  +  S . 

Our  solution  respects  the  external  priorities  of  the  users,  while 

maximizing  the  system's  efficiency. 

Example  5:  There  arc  2  CPU's  but  only  one  bus  (or  channel). 


a.  . 

1 1 

CPU 

I/O 

w.  decided  ir  allocated 

i. 

job  1 

.4 

.6 

.5 

job  2 

.3 

.7 

.5 

job  3 

.5 

.  5 

.5 

job  4 

.0 

.  1 

.5 

job  5 

.8 

.  2 

•  rj 

job  G 

.6 

.4 

.5 

These  6  candidates  are  in  the  order  of  their  external  priorities. 
There  is  no  constraint  duo  to  non-prcenntiblc  resources  in  this  example 
The  rec\  lor  can  verify  that  the  algorithm  (v’ith  c.  *  1,  \j  i)  will 
accept  jobs  1  and  2,  reject  3,  and  accent  4  and  5.  This  is  intuitively 
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a  good  choice  because  1,2,  3  are  I/O  bound  while  the  others  are  compute 
bound.  S  =  |l,2f4,5^jM  =  I  3,6^  ,  and  the  priority  assignments  are: 


<Pij)  = 


2  3 
1  4 

5  6 
4  1 

3  2 

6  5 


Note  that  job  3  would  have  been  accepted  if  w  <_  .4.  This  as¬ 
signment  gives  the  resource  usage  u^,^  =  .0,  ui/Q  =  .6,  which  could  be 
improved  by  solving  equations  (III-2)  (oaualities)  for  the  wVs  with 
the  p  's  that  were  just  computed. 


Comments : 


1)  When  deciding  about  the  desirabilities  of  the  jobs,  onlv  the 
external  priorities  and  not  a  more  precise  quantitative  measure  of 
their  urgencies  were  taken  into  account. 

2)  Clearly,  if  the  w^'s  were  comDuted , instead  of  just  being 
arbitrarily  decided  before  the  algorithm  started, a  more  optimal  solution 
could  have  been  obtained. 


III. 4. 3. 3  A  more  general  algorithm. 

The  following  algorithm  attempts  to  find  a  nearly  optimal  solution 
to  the  problem.  Tt  works  in  two  steps: 

1)  Get  an  approximate  solution  by  optimizing  the  economic 
criterion  ( 1 1 1— 5 )  with  the  following  constraints: 

V  E-r1  *  1 

ifiS  j 

VjtOL  y  a  w  $  1 

i  es  3 


<  1 1 1 — 7 )  < 
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The  constraints  for  the  resources  of  sot  (B  are  identical  in 
equations  (III-6)  and  (III-7).  The  constraints  concerning  the  resources 
of  set  are,  however,  weaker  in  equations  (III-6)  than  in  equations 
(III-7).  The  latter  just  express  the  best  possible  case  (where  no 
unnecessary  interference  between  jobs  would  happen) ,  however,  this 
method  is  used  because  equations  (III-7)  are  easier  to  manipulate  than 
equations  (III-6)  and  a  more  refined  solution  will  be  attained  later. 

This  first  step  is  essentially  intended  to  eliminate  from  further  con¬ 
sideration  the  jobs  which  should  certainly  not  be  scheduled  (for  which 
w^^  =  0)  . 

To  get  a  good  approximate  solution  of  this  mathematical  programming 
problem  (III-7),  it  is  not  necessary  to  use  an  cnumorative  method  of 
search.  A  faster  method  which  gives  a  good  approximate  solution  works 
as  follows: 

,  1 7 

Assign  an  initial  weight  K_,  to  resource  j.  Assign  an  initial 

w^  to  job  i.  Compute  the  desirability  for  each  job: 


d. 

l 


c , 

l 


I 


b.  .  K,  + 

i]  3 


w. 

l 


K. 

1 


Sort  the  jobs  according  to  their  desirabilities.  Starting  with 
the  one  of  hiqhest  desirability,  compute  whether  the  job  can  bn  allo¬ 
cated  or  not,  tha*-  is,  if  couations  (III-7)  can  bn  satisfied  with  S 
consisting  of  the  jobs  which  have  alreadv  been  allocated  and  of  the  job 
which  is  a  candidate  to  be  added.  Whether  the  job  has  been  allocated 
or  not,  try  the  next  one. 


17  ... 

The  initial  weights  when  the  microscheduler  is  activated  miaht 
be  the  final  weights  obtained  at  its  previous  activation. 
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When  all  the  jobs  have  been  examined,  comoute  a  new  weiaht  as¬ 
signment  (the  K_.'s)  and  the  new  w, 's  according  to  the  principle  that  a 
job  having  larger  d^  should  have  a  larger  w^,  and  that  a  resource  for 
which  the  corresponding  equation  (III-7)  had  its  loft  side  much  smaller 
than  1,  should  have  its  weight  decreased. 

1 8 

This  entire  process  can  be  repeated  2  or  3  times. 

2)  Having  determined  the  set  S,  a  bettor  approximation  of  the 

w^'s  can  be  determined  by  solving  equations  (ITI-R) ,  with 

p.  .  <  p. ,  .  <=>  a,  ,/c.  <  a. , ,/c. , . 

l  j  l  j  i  j  i  i  j  l 


(III-8) 


1  "  Wi  *  j  60. 


V  i  € 


If  any  of  the  w. 's  of  the  solution  is  neaative,  this  w.  is  re- 

moved  from  set  S,  and  equations  ( I I I — 8 )  are  solved  again,  As  shown, 

equations  (III-7)  gave  a  set  of  users  to  be  allocated  which  could  bo 

somewhat  too  large.  Eliminating  some  users  from  this  set  in  some  cases, 

yields  a  nearly  optimal  set  to  satisfy  equations  (III-6)  while  maxim- 

1 8 

izing  E  given  by  equation  (III-5). 


I I I . 4 . 4  A  summary  of  the  proposed  scheduli no  method. 

1)  Macroscheduling i  It  has  been  shown  how,  niven  the  a^'s, 

b.  ,'s  and  c. 's,  the  macroscheduler  determines  the  o. ,'s  and  w. 's  and 
13  i  il  i 


This  algorithm  has  been  programmed  and  checked  for  several 
examples,  for  various  numbers  of  users  and  resources.  It  always 
worked  satisfactorily.  Note  that  the  choice  of  the  rule  used  to 
get  a  new  set  of  weights  is  essential  to  obtain  a  fast  convergence. 
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transmits  them  to  the  microscheduler.  It  also  allocates  the  non- 
preemptible  resources  for  a  period  of  time  T. 

2)  Microscheduling:  the  microscheduler  keeps  track  of  the  u sage 
of  preemptible  resources  by  the  allocated  users.  If  user  i  uses  re¬ 
source  j  during  more  than  a  time 

w.a.  .T 
i 

then  job  i  is  punished  in  the  sense  that  its  orioritv  for  this  re¬ 
source  is  changed  to  a  priority  lower  than  any  job  which  had  not  ex¬ 
ceeded  its  quantum  on  the  resource.  This  method  assures  that  a  job 
which  accurately  estimated  its  needs  will  be  served  at  least  as  well 
as  promised. 

This  changing  of  priorities  bv  the  microscheduler  does  not  affect 
the  previsions  of  the  mathematical  model  (which  assumed  that  the  micro¬ 
scheduler  did  not  touch  the  priorities  but  onlv  inforce  them) ,  for 
priorities  are  only  changed  when  a  user  exceeds  his  allowed  quantum  on 
a  resource. 

III. 4. 5  Pricing. 

The  determination  of  prices  is,  to  a  large  extent,  a  consequence 
of  the  scheduling  strategy.  In  the  approach  taken,  a  user  agreed  to 
pay  at  most  a  price  c^w^  to  get  a  progress  rate  w^,  and  if  he  nrooosed 
a  larger  c^  he  got  higher  priority. 

However,  the  system  should  charge  the  various  jobs  being  allo¬ 
cated  more  or  less  uniformly.  It  should  not  iust  charge  the  maximum 
possible  to  each  job,  because  otherwise  the  jobs  would  start  with  very 

low  c  's  and  then  increase  them  slowly  until  thev  were  scheduled,  thus 

i 

leading  to  a  greater  overhead.  The  marginal  theory  oc  pricino  theoret- 
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ically  requires  the  system  to  charge  user  i  exactlv  cVw.  ,  where  c',<. 
is  the  lowest  bid  that  the  user  would  have  had  to  offer  to  get  allo¬ 
cated.  Unfortunately,  this  definition  would  lead  to  very  complicated 
computations.  I  suggest  here  a  few  alternative  methods. 

1)  If  l  is  the  first  job  which  was  skipped  (not  allocated)  when 
the  jobs  were  scanned  in  order  of  decreasing  desirabilities  in  the  first 
step  of  the  macroscheduling  algorithm,  and  if  the  effective  proqrcss 
rate  for  job  i,  charge  job  i: 

n.  =  min  (c .  ,  c. )  x  w.  x  T 

1 1  l  l  l 

2)  If  iobs  having  estimated  their  a.  ,'s  incorrectlv  are  to  be 

•  1 3 

penalized,  and  if  job  i  has  effectively  used  an  amount  r„  of  resource  j, 
he  is  charged: 

r.  . 

p.  »  min  (c.  ,  c.)  x  max  (— — ) 
i  lx  a .  . 

13 

3)  A  unit  cost  for  resource  j  could  also  have  been  comnuted: 


g.  =  K.  d?,  where  l  is  the  first  job  not  allocated 
3  3  l 

and  Kj  the  weight  of  resource  j,  as  computed  by  the  macroschedulinn 
algorithm.  If  job  i  uses  resource  j  during  a  time  r^,  he  could  be 
charged: 


p .  =  min  (c,  w.  ,  r .  .  \i .  ) 
i  i  i  j  ID  3 


It  is  useful  to  have  some  prices  for  resources,  so  that: 

1)  A  new  coming  user  can  by  immediate  inspection  of  the  prices 
determine  whether  he  wants  to  get  on  the  system  or  not. 

2)  On  the  long  range,  the  computer  center  staff  might  determine 
the  needs  to  install  or  remove  facilities  (see  tlielscn  [16]). 

The  variations  of  the  u.'s  in  time  should  probably  be  smoothed 
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for  those  purposes. 

III. 5  Models  of  other  priority  systems. 

I I T . 5 . 1  The  equipriority  case  without  preemption. 

In  this  "no-priority  case",  a  user  seizing  a  resource  will  never 
be  preempted  and  will  not  lose  the  resource  until  he  decides  to  release 
it.  The  situation  may  lead  to  almost  no  parallelism  in  the  computations . 
The  worst  case  equations  are: 

(III-9)  tfi€[l,n]  l>w  ♦  I  ak.  wk 

k£i  3 

j 

In  the  situation  of  example  #3,  this  gives  the  following  proaress 

rates : 

w.^  =  16,  w2  =  .32,  w3  =  .64 
so  that  the  overlap  of  activity  is  small: 

overlap  =  w^  +  w2  +  w^  -1  =  12% 

Theorem  5 :  The  attainable  domain  of  the  no-priority  system  is  pro¬ 
perly  contained  in  the  attainable  domain  of  any  priority  system. 

Proof:  compare  equations  (III-2)  and  (III-9)  .  The  latter  imply 
the  former. 

Therefore  the  no-priority  case  is  uninteresting ,  and  should  be 
avoided  in  any  actual  system  desiqn. 

III. 5. 2  The  equipriority  case  with  preemption. 

This  case  would  also  be  called  the  case  of  "Randomly  turning" 
priorities.  The  model  is  characterized  by  the  following  microscheduling 
method: 


Tne  time  is  divided  into  very  short  intervals,  and  the  priority  of 
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the  users  for  the  various  resources  is  changing  from  one  interval  to 
the  other,  cycling  so  that  each  user  spends  the  same  amount  of  time  in 
each  priority  level.  Typically,  the  time  between  two  priority  changes 
might  be  100  microseconds  and  is  small  compared  to  the  interval  betwee  n 
two  allocation  requests  of  jobs  to  the  microscheduler.  Nevertheless, 
assume  that  this  method  does  not  introduce  any  additional  overhead. 

A  random  number  generator  miaht  be  used  at  the  beginning  of  eacl 
time  interval,  to  generate  the  job  priorities  during  this  interval. 

This  would  insure  that  there  is  no  regular  pattern  of  one  job  spending 
most  of  the  time  at  a  higher  prioritv  than  another,  as  happens  with  a 
circular  permutation. 

The  idea  of  such  a  microscheduling  algorithm  has  the  following 
justifications : 

1)  The  hardware  could  allow  time-sharing  of  a  CPU  or  a  channel 
on  very  short  time-slices,  however,  we  don't  know  whether  this  would  be 
a  good  practice. 

2)  It  is  desirable  to  assure  a  user  of  a  certain  percentage  of 
use  of  some  resources,  under  any  circumstances.  Time-slicino  on  a  very 
short  time  basis  might  seem  a  natural  wav  to  do  it.  If  user  i  is  assured 


of  having  the  top  priority  on  resource  j  durinn  a  portion  of  time  A_.  AT 
where  AT  is  some  small  interval  of  time,  then,  with  the  a^'s  defined 


previously,  his  progress  rate  will  be  at  least 


w , 

l 


nin 

j 


However,  a  much  higher  "lower  bound"  estimate  for  the  w/s  can 
be  computed.  After  having  done  it,  these  new  "worst  case"  equations  will 


be  compared  to  equations  ( 1 1 1 — 2 )  and  it  will  be  shown  that,  under  some 
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assumptions ,  the  "turning  priorities  microschedulinci"  performs  poorer 
than  a  fixed  prior'  algorithm  with  the  p. ,'s  well  chosen.  This  re¬ 
sult  has  been  checked  by  simulation,  and  the  followina  discussion 
attempts  to  establish  a  theoretical  justification. 

Under  this  new  model,  if  k  users  compete  for  some  resource,  each 
one  will  get  it  during  a  portion  of  the  time  1/k.  Consider  resource  j 
User  i  will  seize  it  during  a  period  T  a^w..  In  the  worst  possible 
case,  the  maximum  overlap  of  requests  occurs  on  resource  j.  Thus,  the 
time  spent  by  user  i  waiting  for  resource  j  is  less  than  or  equal  to 

min  (a  .w,  T,  a.  .  w.  T) 
k  ^  i  k]  k  i]  i  ' 


This  points  out  that  if  a  job  k  asks  for  less  time  on  resource  j 
k"‘an  3°b  the  maximum  time  spent  by  job  i  waiting  for  resource  j  be¬ 
cause  of  job  k  will  be  T  aw.  If,  on  the  other  hand.  a.  ,w.T<a  w  T, 

*  i]  1  kj  k  1 

job  i  will  wait  for  resource  j  because  of  job  k  at  most  durina  a  time 

aijWiT*  *see  fig*  TII_3)* 

The  worst  case  equations  are  thus: 


(III-10) 


+  £  min  (  a  w 
k#i  K3  K 

j 


"ij  Wi  > 


Vi6[l,n] 

(w^  0) 


These  equations  define  the  attainable  domain  with  turning  priorities. 

Theorem  6:  For  every  point  in  the  attainable  domain  defined  by  eauations 
(III-10) ,  there  exists  a  priority  system  in  which  this  point  is  attainable 


according  to  equations  (III-2) . 


Proof:  Define  this  priority  system  by: 


o.  . 

il 


< 


M 


W, 

K 


<  =  > 
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Assume  that,  for  a  given  j,  the  a^'s  are  all  different.  Then 
obviously,  equations  (III-10)  imply  equations  (TIT-2)  for  this  system, 
which  are: 


Vi  €U,n]  1  >  w;  +  £  a  w 

1  k#i  * 

a,  .w,  <  a,  ,w, 

*D  k^  id  i 

This  theorem  is  reassuring  because  it  says  that  whatever  a  user 
is  assured  of  doing  under  a  turning  priority  system,  he  is  also  assured 
of  doing  under  a  fixed  priority  system. 

However,  the  following  theorem  can  be  proved  under  some  restrictive 
assumptions : 


Theorem  7: 

If  one  of  the  following  is  true: 

1)  There  are  only  2  jobs  (and  any  number  of  resources). 

2)  There  is  any  number  of  jobs,  but  comoetition  is  limited  to 
one  resource  only;  then  there  exists  a  priority  system  whose  attainable 
domain  includes  the  domain  defined  by: 


p.  .  <  p.  .  <=>  a.  .  <  a 

ii  kn  ii  in 

The  proof  is  shown  in  appendix  B. 

Theorems  6  and  7  show  that  a  fixed  priority  system  should,  to  a 
certain  extent,  be  preferred  to  a  random  Drioritv  system  (which  is  it¬ 
self  better  than  no  preemptibi lity  at  all).  TF  a  resource  has  the  pro¬ 
perty  that  it  can  be  preempted  without  anv  other  additional  future  loss 
of  time,  then  the  available  information  on  the  jobs  can  be  used  to 


assign  priorities  for  the  resources,  and  a  "good”  choice  is  to  assign 


the  resource  to  the  job  which  has  the  least  need  for  it  (after  havina 
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weighted  these  needs  by  the  external  urgencies  of  the  jobs,  which  leads 

a ,  . 

to  the  quantities  — . 

i 

III. 6  Problems  for  further  research. 

1)  Continuous  macroscheduling:  Instead  of  applving  the  macro¬ 
algorithm  at  regular  time  intervals,  find  a  simplified  macroalgorithm 
to  be  applied  each  time  a  job  previously  running  deactivates  itself 
voluntarily,  or  when  a  job  changes  its  external  prioritv,  or  even 
when  the  swapoing  channel  is  idle.  Jobs  might  be  scheduled  or  un¬ 
scheduled  just  using  the  desirabilities  which  have  already  been 
computed,  but  it  miqht  also  be  desirable  to  recompute  the  p  ^'s,  the 
K . ' s ,  the  d.'s  and  the  w.'s. 

2)  Extend  the  models  to  include  processes  using  more  than  one 
resource  at  a  time.  For  instance,  Fig.  I I 1-4  shows  the  virtual  time 
diagram  of  a  uses?  who  initiates  I/O  and  swapping  at  the  same  time: 


CPU 
I/D 
Swap* opt 
Swap- in 

memory  (20KJ 
memory  (2K) 


Resources 


Figure  III-4 


Virtual  time 


Another  characteristic  of  our  hypothetical  job  is  that  it  does 
not  need  all  its  memory  resource  continuously  (a  buffer  of  2K  is  enough 


80 


during  I/O  completion) .  Could  this  knowledge  he  taken  care  of? 

Solving  this  problem  would  be  especially  useful  for  future 
computer  systems  where  the  cost  of  arithmetic  and  control  units  is  ex¬ 
pected  to  decrease  much  more  than  the  cost  of  central  memories. 

3)  Find  models  of  "probable"  performance  as  well  as  "worst  case" 

ft 

models . 


4)  Which  information  other  than  a^.'s  or  the  b^.'s  on  the  jobs 


would  be  relevant  to  an  allocation  algorithm? 

For  instance,  the  exact  virtual  time  at  which  a  job  will  place  a 
request  might  be  available  for  some  jobs  while  being  completely  out  of 
the  question  for  others. 

5)  How  much  would  the  results  of  the  model  be  affected  by  slight 
errors  in  the  predictions? 


III. 7  Conclusion. 

My  initial  effort  was  applied  to  separate  problems  which  are 
usually  handled  together  in  a  very  intricate  manner:  1)  Scheduling: 

2)  Paging  algorithms;  3)  Deciding  external  priorities  of  users  4)  Col¬ 
lecting  information  about  the  average  probable  needs  for  resources  of  a 
soecific  job.  Pricing,  however,  should  not  be  a  question  separated  from 
scheduling.  The  problems  of  protection  and  of  doadlv  embrace  had  already 
been  separated  from  the  others  in  previous  works.  Ry  partitioning  the 
difficulty,  I  believe  that  the  way  to  bettor  scientific  understanding 
of  shared  computer  systems  stands  open. 

The  previous  scheduling  algorithms  and  models  apnlv  in  computer 
systems  where  the  shared  facilities  can  either  be  preempted  with  very 


little  overhead  (CPU,  busses  between  two  levels  of  fast  memory),  or 
cannot  be  reallocated  without  a  great  amount  of  overhead  (memorv) . 

The y  do  not  apnly,  however,  in  cases  whore  a  resource  can  be  preempted 
but  the  delay  imposed  on  the  preempted  job  is  greater  than  the  time 
during  which  the  preemption  occured.  This  would  be  the  case  if,  for 
in^stance,  a  job  is  swapped  from  the  drum  into  momorv,  but  if  at  a 
certain  moment  it  can't  get  one  of  the  cages  because  another  job  has  a 
higher  priority  to  get  a  page  from  this  sector  of  the  drum,  then  the 
preempted  job  will  have  to  wait  an  entire  revolution  of  the  drum  before 
the  opportunity  to  get  the  missing  page  is  reneatcd,  and  the  cost  of 
having  a  set  of  pages  idling  in  memory  during  all  that  time  is  of  course 
important.  In  such  a  case,  the  right  strategy  might  be  to  avoid  pre¬ 
emption,  and  to  decide  what  to  do  bv  computing  a  "desirabilitv  ratio" 
for  each  possible  scheduling  operation  (ratio  of  the  urqencv  bv  the  total 
cost  of  the  resources  involved).  (see  section  IV. 4). 

It  is  my  belief  that  the  scheduling  techniques  described  in  this 
chapter  will  be  especially  useful  for  scheduling  of  real-time  users,  who 
want  to  have  the  assurance  of  getting  a  certain  percentage  of  usage  of 
the  resources  of  the  machine  before  they  start  working. 

Other  investigations  of  multilevoled  schedulina  are  still  necessary. 
I  believe  that  queuing  theory  gets  enormously  comnlicated  too  ranidly 
when  the  number  of  servers  and  the  complexity  of  the  queue) no  strategy 
increase.  Simulation  is  a  fast  way  of  testing  whether  some  algorithm  is 
workable,  but  is  not  more  than  a  predictive  technique.  It  does  not  seem 
to  be  likely  in  the  future  that  a  scheduler  will  first  simulate  the 
situation  before  making  a  decision.  Analytical  approaches  are  almost  all 
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that  are  left  to  improve  schedulers  in  the  future  with  the  certainty 
that  the  designed  algorithm  will  wori'  almost  optimally  in  all  cases. 


CHAPTER  IV 


SWAPPING  ALGORITHMS 

In  this  chapter  a  study  is  made  of  swapping  algorithms  for  a 
computer  with  two  levels  of  memory:  drum  and  core.  Pre-paginq  takes 
place  before  a  program  uses  the  CPU,  an  entire  working  set  of  pages 
of  this  program  is  swapped  into  core.  Section  IV. 1  presents  the 
Berkeley  or  Van  Tuyl  algorithm,  which  was  developed  under  the  direc¬ 
tion  of  Butler  Lampson  at  Berkeley.  Then,  by  contrast,  another  swapp¬ 
ing  algorithm  is  presented  in  section  IV. 2.  I  then  explain  why  I 
think  that  the  latter  algorithm  is  much  more  appropriate  than  the 
former,  especially  for  future  computer  systems.  Resource  utilizations 
of  users  programs  under  both  algorithms  are  compared  in  section  IV. 3. 
Finally,  section  IV. 4  gives  some  indications  as  to  how  a  drum  to  core 
system  should  be  scheduled  if  the  swapping  algorithm  of  section  IV. 2 
is  used. 

The  various  notations  used  in  this  chapter  are  completely  inde¬ 
pendent  from  those  used  in  the  previous  chapters. 

I V . 1  The  Swapping  Algorithm  of  Van  Tuyl. 

llO]  describes  a  swapping  algorithm  between  drum  and  core  which 

19 

was  intended  for  the  BCC-1  computer.  The  system  has  essentially  four 
resources : 

19 

It  was  initially  designed  for  the  SCC  6700  computer. 
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a)  one  CPU 

b)  core  memory 

c)  drum  (capacity  supposed  to  be  infinite) 

d)  a  channel  between  core  and  drum 

A  program  might  be  in  four  possible  states: 

1)  on  the  drum, 

2)  being  brought  into  core, 

3)  in  core,  waiting  for  or  using  the  CPU,  or  waiting  for  an 
absent  page , 

4)  being  swapped  out  of  core,  to  the  drum. 

An  "external  scheduler"  decides  which  programs  are  candidates  to 
be  brought  into  core,  and  among  those  which  are  in  core,  which  one  gets 
the  CPU,  or  which  are  candidates  to  be  swapped  out. 

A  program  is  considered  to  be  in  core  when  a  certain  set  of  pages 
is  in  core  (this  set  might  be  the  entire  program) .  Programs  are  sup¬ 
posed  to  be  small  enough  so  that  their  pages  can  be  retrieved  entirely 
in  one  drum  rotation  (if  there  is  not  conflict).  A  conflict  occurs  if 
two  programs,  while  both  are  being  brought  in,  happen  to  have  a  page  on 
the  same  sector  of  the  drum. 

The  swapper  is  an  algorithm  which  has  to  decide,  at  each  sector 
of  the  drum,  which  page  should  bo  transferred.  It  might  do  either  a 
read,  or  a  write.  With  Van  Tuyl's  algorithm  (hereafter  called  "the 
Berkeley  algorithm"),  pages  which  are  not  dirty  (not  written  on  while 
in  core) ,  do  not  need  to  be  written  on  the  drum.  Van  Tuyl  simulated 
his  algorithm  with  the  assumption  that  halt  of  the  pages  cf  each  pro¬ 
gram  are  dirtied  while  in  core. 
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Table  IV- 1 

Decisions  of  the  Swapper  in  the  Berkeley  Algorithm 

1.  A  process  that  the  scheduler  has  decided  to  run  is  put  on  the 
swap  in  list  (or  road  list)  if  and  only  if: 

Pages  queued  in  +  pages  of  process  Lo  queue  in  -  free  core  <  f» 
(where  ?  i3  a  system  dependent  constant)  ,  and  panes  queued  out  «■ 
freo  slots  on  read  sector  list. 

2.  Drum  corsnand: 

a)  I_f  no  road  to  do  and  at  least  one  write  to  do  then  write 
out,  exit. 

b)  Compute,  if  conflict  (many  processes  requesting  to  read  a 
page  in  on  the  *ne«  sector),  for  ill  mowoncs  on  the  read  list: 
Coat  of  process  •  tiro  to  complete  reads  x  (i»PC) 

where  PC  *  number  of  page*  of  the  ptocess  In  core 
a  _>  *  pages  a  process  way  have  ( *  In  the  «,v*r  for  all 
process,  Ilfce  £  wan  previously). 

c)  l_f  there  is  4  free  page  in  core  then  read  *h#  page  ef  the 
lowest  cost  prnce*»,  exit. 

4)  tt  co*l  of  tea*!  •  5  or  no  pair-  cm  Le  i*«ln'  M  In  cote,  tjr»» 
da  a  «rlte  t<| *p  r^.rl  the  f*wte  o*  the  if»<e*t  e«*nt  pr*** c* » ,  rrit, 
).  S  pgg«  I  *  rel*<t***j  u 

In  cote  fr Irtrgivi  to  a  ffr»r«NK  f  ,  *im*»  that 

rml  lpf)  *  PH*  |«  jf 

C1*!  If%l  *«’•»! 

r  *  m 

■vhete  {>  I*  » V  ’“b® »•  tref  p»we*«*.  ♦*<)  (*  t  «*'*.  ff 
ftiili  !»•*  *«  eee,  in. 


\ 
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A  complete  description  of  the  swaoper  operations  is  given  in 

table  IV-1. 

Note  that  the  swapper  does  not  "look  ahead. "  When  the  drum  is 
positioned  at  a  certain  sector,  the  swapper  ignores  which  paqes  will 
be  candidate  to  be  swapped  in  on  later  sectors.  This  is  a  reasonable 
choico  because  the  algorithm  is  already  quite  complicated,  and  one  ray 
wonder  whether  all  the  decisions  of  table  IV-1  can  b.  made  by  a  micro- 
progr  mnod  processor  during  t’ao  tino  of  one  p.i.;p  read  (1  mill  isceond)  . 

IV. 2  Another  sv  ippinq  algorithm. 

The  main  iili.i  of  this  new  swapping  algor  it  in  is  to  swap  coni  x  gu- 
omly  in  tin*  the  pages  which  belong  to  the  Working  5.*t  of  a  program 
which  baa  to  bo  brought  In  or  out  of  cor**.  contrary  to  the  Berkeley 
algorithm,  all  of  the  |tage»  are  swapped  out  (anti  no*  only  the  dirty 
pages).  Thin  somewhat  Increase*  th«  channel  utt  lira!  ton,  but  intuit « 

In  big  saving*  in  *ew ,ry.  Tin.  reanon  for  then**  navlnnit  in  that  a  small 
p roo ran  can  r.uw  l*e  brought  Into  core  in  much  le«n  than  an  entlr  drum 
revolution  nlncc  Its  pagen  occury  contiguous  <%r-  icio  r*f  the  drum.  A 
program  la  t  ovn  to  be  wapped  m  because  n  ha**  a  high  rxtcrnal  pri¬ 
ority  an!  bffiuvi  its  i«i  of  l*  alicut  l*»  r«  tr*i  t lvc*  drun  heal*. 

To  avoid  l><  |  -  *»  st««t  llty  that  nv*h  a  istr-i'en*  *■  «|*l  Imbflmtclv  d*lay 
the  ni*air«t  of  n«*»*  Ur  *  imblrr  rust  >1  i  k  "in  niv me**  w’lteb 

v«vwlii  1*«  *  i5-  Hi  in  or  «Ht ini  a  ittw'  r*  v  -it**  fsee  r«*  i  ten 
IV.  41. 

Utd  this  al%eflf!m  tvaliVtv  If  M«*r  !l#lr«  tle  lid  g* 
thr  rv«£»r.  ;irt*  is  nnly  t«  ;#  ■%  t«  •  s  in  »•  a  tier*,  g»  on. 
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job  to  be  swapped  out.  As  long  as  the  swapping  (in  or  out)  of  the 
current  job  is  not  completely  finished,  there  is  no  possible  conflict, 
and  thus  the  next  page  to  bo  transferred  is  always  obvious.  When  the 
swapping  of  a  job  is  completed,  the  next  job  to  be  swapped  must  be 
selected.  It  can  be  any  job  in  core  if  a  swap-out  is  desired;  other¬ 
wise,  a  job  must  be  chosen  by  the  scheduler  to  be  swapped  in  such  that 
its  first  drum  sector  has  not  yet  arrived  at  the  read  heads.  When 
initiating  a  swap-in  operation,  there  must  be  an  assurance  that  enough 
free  coro  is  available.  The  problem  of  determining  which  job  will  bo 
swapped  in  or  out,  will  bo  studied  in  section  IV. 4. 

I V .  3  Comparison  of  the  resource  utilization  un ior  both  algorithms. 

To  simplify,  supj>ose  that  there  is  just  one  CPU  and  one  drum. 

Then,  there  are  tlireu  important  resources:  the  CPU,  the  core  memory, 
and  the  drum- to -core  channel.  The  drum  rreno-y  is  supposed  not  to  be 
saturated  under  normal  conditions.  The  utilisation  of  these  three 
resources  is  computed  by  a  proqram,  during  an  entire  cycle  (swap-in, 
compute,  swap-out).  Those  utilizations  will  be  normalized  in  tine- 
utilization  of  tho  entire  resource.  Tor  instance,  if  the  core  nem.ory 
size  is  the  use  of  an  amount  n  of  core  durina  time  t  is  normalized 

fp| 

to  a  memory  utilization  of  -  t. 

?! 

Khile  computing  the  resource  utilizations,  some  simplifying  assump¬ 
tions  are  made.  »ho  strong*”*!  ol  th*n  is  to  neoleet  the  increased  uti¬ 
lization  of  reeary  and  channel  due  to  conflict''  in  lh**  Herhelny  algor¬ 
ithm.  Removing  this  particular  assumption  sttenith«n*  th«  conclusions 


wMch  folio*  even  *,*3r© 
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The  following  definitions  will  aid  in  the  discussion: 

S  number  of  sectors  on  one  drum  revolution 

P 

P  number  of  pages  of  a  program  (  -  of  them  dirty) 

M  size  of  core  memory 

T  time  spent  by  the  CPU  on  a  program,  while  it  is  in  core 
S  and  P  both  have  a  time  dimension.  In  these  computations,  the 
time  unit  is  the  time  to  read  one  sector  from  the  drum  (1  millisecond 
on  the  BCC-1) .  The  resource  utilizations  are  shown  in  table  IV-2. 

The  channel  utilization  time  is  ^ P  in  the  first  algorithm,  com¬ 
pared  to  2P  in  the  second,  due  to  the  fact  that  the  dirty  pages  are  not 
swapped  out  in  the  first  algorithm. 

Memory  utilization  is  computed  as  a  space-time  product  (see  fiqure 
IV-2).  With  the  Berkeley  algorithm,  for  instance,  the  program  is 

brought  into  core  in  one  drum  revolution  (time  S)  ,  handled  in  time  T  by 

3 

the  CPU,  and  swapped  out  in  time  -  P  (in  the  first  phase  of  this  swap- 
out,  the  clean  pages,  which  do  not  need  to  be  swapped,  arc  replaced  by 

p 

another  program's  pages;  in  the  second  phase,  the  -  ilirtv  pages  are 
swapped,  at  a  rate  of  one  page  per  two  sectors,  the  other  sector  tine 
being  given  to  a  read).  The  total  resource  utilization  of  a  program  is 
new  defined  as  the  naxinum  of  the  throe  resource  utilizations  (channel, 
core  and  CPU) : 


For  algorithm  *1: 

t',  »  max 


ror  algorithm  *2; 

U  •  max  IT,  PT  »  2)  *  ,  ?H 
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The  Berkeley  algorithm  was  simulated  by  Van  Tuyl  under  various  assump¬ 
tions,  among  them,  P  =  10  pages,  S  =  32  milliseconds  (or  pages),  M  =  32 
pages  and  T  =  10  milliseconds.  Table  IV-2  shows  that  =  15  <  =  20 

with  these  data,  so  that  the  1st  algorithm  really  behaves  better  than 
the  second  algorithm,  and  the  bottleneck  really  lies  in  the  channel. 

If  the  memory  size  and  the  speed  of  the  channel  are  decreased  to  P  =  5, 
S  =  64,  M  =  10,  and  T  =  10 ,  the  second  algorithm  performs  much  better 
than  the  first  one:  =  23  >  =  10,  and  the  bottleneck  of  the  first 

algorithm  lies  in  the  memory. 

Now  follows  a  study  of  how  the  resource  utilization  would  change 
if  the  characteristics  cf  the  available  hardware  wore  to  chancie. 

The  size  of  the  memory,  M,  the  length  of  a  drum  revolution,  S, 
or  the  bandwidth  of  the  channel,  B  could  be  varied,  n  was  supposed  to 
be  equal  to  1  in  the  previous  computations.  More  nenerally: 

/  „  E2  \ 

/  _  2  0  B  ,  3  P  1 

Ul-maX^T'  - K -  2  b) 


max 


/  Pi  +  2  \ 


1)  Effoct.  of  bandwidth. 

Figure  IV-3  shown  the  effoct  of  bandwidth.  Tor  high  bandwidth, 
algorithm  *2  performs  better  than  algorithm  M,  as  expected.  Uoti 
that  this  is  not  true  if  T  were  high  (in  which  case  both  algorithms 
would  he  CPU  bound);  but  the  assumption  is  made  that  CPU's  are  netting 
faster  and  cheaper,  and  are  not  the  critical  resources  of  modern  com¬ 


puter  systems. 
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2)  Effect  of  drum  rotation  time. 

Figure  IV-4  shows  how,  when  the  drum  rotation  time  increases, 
algorithm  #1  looses  efficiency,  but  algorithm  #2  does  not  degrade  at 
all.  This  is  due  to  the  fact  that  the  memory  utilization  by  algorithm 
#2  is  independent  on  S.  This  will  allow  the  possible  use  of  slow, 
cheap  drums  in  future  computer  systems. 

3)  Effect  of  a  change  of  the  relative  cost  of  core  memory 
versus  cost  of  other  resources. 

In  figure  TV-5,  it  can  be  seen  that  if  the  core  memory  size  de¬ 
creases,  the  second  algorithm  does  not  get  memory  bound  as  rapidly  as 
the  first  one.  This  will  be  helpful  if  memory  is  the  critical  resource 
in  the  future. 

Demand  paging.  If  a  page  is  missing,  the  normal  strateqy  under 
algorithm  #1,  is  to  leave  the  program  in  core  while  the  page  is  being 
brought  in.  But,  with  algorithm  #2,  if  enouoh  bandwidth  is  available, 
it  is  cheaper  to  swap  the  entire  working  set  of  the  program  back  onto 
the  drum,  to  bring  the  missing  page  into  core,  and  then  to  swap  the 
working  sot  in  aaain  when  it  arrives  under  the  read  heads  of  the  drum. 
The  samo  considerations  apply  for  a  short  I/O  operation. 

A  final  word  is  necessary  about  the  accuracy  or  these  resource 
usage  estimates.  It  wos  assumed  that  there  would  be  no  eonflict:  in 
othor  words,  for  each  sector  of  the  drum,  there  in  zero  or  one  pane 
transferred,  but  never  pages  of  two  different  lobs  both  want  ins  to  be 
transferred.  Possible  conflicts  tend  to  increase  sometimes  considerably 
th«  resource  ussge  (memory  and  channel)  for  nleorith-s  ■!,  whose  actual 


on  total  resource  utilization  of  a  program 


on  total  resource  utilization  of  a  cronran 
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behaviour  can  be  much  more  resource  consuming  than  the  figures  show. 

This  would  lead  to  a  preference  for  algorithm  #2  even  more  than  was 
previously  computed. 

IV . 4  Scheduling  a  computer  system  under  the  algorithm  of  section  IV. 3. 

The  scheduling  problem  considered  here,  consists  of  deciding 
which  job,  at  a  given  time,  is  to  be  swapped  into  core,  out  of  core, 
or  to  occupy  the  CPU. 

IV . 4 . 1  Scheduling  Criterion. 

For  user  i,  we  suppose  to  be  known: 

1)  his  bid  C^ . 

2)  his  requested  CPU  utilization  tine  . 

C^  is  the  bid  for  an  entire  cycle  of  swap-in,  CPU  usage  during 
an  interval  T^ ,  and  swap-out. 

An  additional  constraint  is  that  a  job  can  only  be  swapped  in 

when  its  sectors  on  the  drum  pass  under  the  read  heads. 

The  system's  criterion  for  scheduling  is  to  maximize 

E  =  Z  C. 

ies 

where  S  is  a  set  of  users  which  can  be  run  over  a  given  time  interval. 

I V . 4 . 2  Jobs  Desirabilities. 

It  is  desirable  to  allocate  jobs  in  a  way  such  as  to  get  a  balance 
of  resource  usage  (to  swap,  for  instance,  a  larao-sized  job  while  a 
small-sized  job  with  a  large  T^  occupies  core) . 

To  get  such  a  balanced  set  of  jobs,  prices  are  first  assigned 


to  the  basic  resources: 


The  desirability  of  job  i  is  then: 


d  “  —  —  ■  '  ■'  .  . — 

1  a  p  ♦  n  p  4  n  l> 

CPU  CPU  Cl!  CM  MEM  Ml  .'•! 


where  ^cpu*  c*C)j,  cx^.j  nre  the  resource  utilisations  of  job  i,  which  are 
given  in  table  IV-2,  as  functions  of  the  characteristics  of  the  job. 


I V . 4 . 3  Job  scheduling  over  a  time - interval . 

No  justification  is  given  here  to  the  aloorihm  which  follows. 
However,  the  reader  will  rocoqnir.e  it  as  a  variation  of  on  alaorithm 
of  chapter  III. 

For  a  given  time-interval  ltJt  t2l ,  a  schedule  is  computed  l»y  the 
following  procedure.  As  nearly  as  possible,  jobs  are  allocated  in 
order  of  decreasing  desirabili ties.  A  job  can  be  scheduled  for  swapp¬ 
ing  if  its  pages  on  the  drum  do  not  share  any  common  sector  with  any 
job  already  scheduled  for  swapping,  and  if  the  jobs  for  which  alloca¬ 
tion  has  already  been  decided,  leave  enough  memory  for  the  new  job  to 
1)  be  brought  in,  2)  wait  for  the  CPU,  31  run,  4)  wait  for  the  channel, 
and  5)  swap  out. 

After  the  schedule  has  been  worked  out  for  this  time  interval, 
the  new  prices  of  resources  are  computed  as  a  function  of  the  old  prices 
and  the  idle  time  of  the  resources.  The  scheduling  algorithm  is  acti¬ 
vated  once  every  units  of  time.  This  minht  bo  typically  a 


drum  rotation  time. 
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ih*?  follovirat  dire*  fw»! 

1)  Nwn  IV  u**r  and  iV  **  !««>,  ii  vm^iI4  t< 

lM«mtlr/|  fO  rl-t-ilrtf)  A  UfHWOI*'  iSitrnw1  Which  I  wW  rv 

hit  nlxtiil  hit  «fMrw  tH«  I!  vrtiH  I  «*«*!**!  to  fV 

system's  resource  allocator).  This  laiwua*?*  v**uM  fvtrt*  tr*h»*«**fn 
between  the  Amount  of  Wf>*y  ih.i*  he  w,im«  re  *i«’hl  I'fi  IV-  rest  nv  V 
U  qoinu  so  net.  etc...  Menu  the.**  *aee  linen.  it  V  irtereetlm 

to  find  out  rore  about  single  ways  o'  aialvctni  tV  strurtftire  of  a 
proqran  and  its  characteristics  for  allocation. 

2)  Structuring  the  resource  allocation.  Users  wmt  rore  an  I  rorr 
cmplicated  contracts  with  tho  system,  whic'  sVuld  nuarantee  then  of 
qettinq  the  service  that  they  expect.  Wider  cl-mses  ,i'  *v>-.nihle  centric 


should  bo  investigated. 


** i*  rw  *'*  *  *<#«*#«#**  **  **#> 

******  *<*i»  **  5  to*  *#-♦« »!«.**  n#  *«*  r^H  f  *|f  M  |  #** 

|w  ***||*  (fit  #*>*"*  1 1  M»k‘  *» 


Ill 


MST  *rrtn'mf$ 


I.  ►  tt,  H  TWHNKiflW  Ow<«W,  T*"  ik«l<:*t 

►f«»***  m,  'I1**/*.  tf«T«  1  ***.  *•!♦'>.  $»«*|| 

llirnM't  IftHM. 

•  «  lAaMt’ft**  ft**#.  $■  I*1*  at  *?****»■  4**4! **1*.  *i  i-wtt 

♦f  im  *■  «  s;.i  U'4t*  m*t.  }?>•*•*. 

1.  'ftvst4  .  V»l*^  f«  «-l  4t  >  j|  *-4iy  1  f  >j»j  f4grf>-Ay|  |ftrf  >  «M>tt.  'Alt** 

wt  cttH*.  ttj,(  mi. 

4o  I'HtAtVj*?.  J  I  It  I  I  |#f  ,  yjtgy*»V  *,*  J  **  .  #■  Ml**** 

vw*|»'f.  *«•  JH  >  *«!», 

i.  t,****i*f,  l**A(*|  *•  1*1*^  ft*  I*  WW'^I 

f ♦&*♦*«  ..■*>  lH*'tnNM  <**«  **>.»!  til*  l»  I*»  *<  fM«  Ctowm- 

*2*  li*  u  itot.  I  ■*■*!.  *?**«*! 

M*-#  *** .  Maim*.  t*<«  §|  *,**•*«* •«.}-(»  !*-»•*  *r^ 

*«**♦*  A*.  *  *J»  t‘«*  •?  *M(*m*i  *,  #  i  f  *  CM  <■•.».  *«, 

<*H.  !*<•'*. 

?•  «>iw,  i.i.  ft  Wwf  a#  •tttvtiral  M*|  Iianiw  will, 

(■<w»f  Nrwff  I.  I  l-M,  m  u- 

*.  »1U  :  «M  '4-^,  f  l  i-#«  .<  »,lt  p  **  ***  **  <*  -  — 

™  ft*.  11.-  'Jw  »'  n.  i* 

*.  hl-*i  L  *  — *  I*.J.  t **■<*  •>*«-  *#|ft*t  if1. 

►rt*  -  ■  *  |}.»  M*» 

***  5*¥#l.  •*  ♦  *»  M  tlwi'M  »-*  V  •»  «*I*  *^»  t»r»  4*<w- 

Ift  «kt»»  tv#««  M*.  as*. I  ,  M*  nil*  •**!  • 

*<  *>■**<*  I  *>f. 

It.  &-it<r*).  X  f  »F  |*V  M  §*•)*  *  *  ♦  *  *M  f.  *%  *  l-Ai.  1 1*.*- 

**-Wf#<i**i  *>  *  •»".*ll  ^<IK  .  ,U)'  I*v5  •*  «g«i?  >  *#• 

Cwnf^s*#  »#♦•'»,  <sf%|,  jni,  '  .  W)iiM«t|liA|  tv  .. 

St*  i*1. 

tl>  tw*l*  *  f6  •«  .*18/***' ♦?  §*>**  !#s  *''*?!  I*  #"* 

.'r*' *•■•'•* ■!  .  5*»»l  *1*  **<  .  t.J.?..  t»f-  »!<»*♦.  ">•»*.. 

**W  |w* 

I}.  !>*••»  i^f.  f*t*f  *•  f$**  v'tll*--*  #  *  !«**  WiiFWt • 

;w.  y.  *  mv  mi.  «}*n>. 


► 
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14.  FonleiUol .  Iv4<«ri  is.  ao>1  Oro««Mun,  MrUn  4.  *r,  Analytic*!  n**l«l 

of  Nlnpnwrwwil  <s«of\itt«o.  Froc.  Artis  !*<**  *!*r  in  j  Joint 
COOfaiBf  O*»'MrnC0.  Vol,  14,  A**!?*  ftess,  fcn'VAin,  *n  717-721. 

V*,  ^W«l,  Rsnlf  |  I'lffN.  #ttoe*U*n  Ift  (Ww«l*r  ev*lr.*w, 

Hfcri  w».  tt*ll  01*. fi.  'Tli'«»«l,  crw^st^r  |w>  iti^wt. 

V.C.I..A..  Itift*  ITf** 

H.  Jfl#l**0,  U.  to  Ik  MlkAMW 

of  C*«VOrt«f f  hN1,  ATIfr.*  |W*  »  .»|  |  *«IM  ri'tv^itrt 

0,«ft»w«,  V(tl.  }),  0».,  P.C.,  i*<. 

iSI-ill. 

17.  fM»l.  *«Al»  9*  tlMl«t.  r«NU>*41  *11*0  «?# 

*42.  ft. f.7.  fr'tfiffrtrt  %X\ 

It.  hlktUM,  tv*o.  A  id  f»f  ♦*»  i ib*.  gm«  AW 

A  l^7»  4-4 }»4 #1 . 

17.  tftpfO.  ♦  *.  4*<1  <3^1  *»*«**,  C.  *n  »•» 

#  i'*tl  ♦»  .  !»»•'  I’M,*  **  tl*<»  .Viol  <  WAlri  f  **., 

N,  7ftwWIW»0  *I«WA  C*  *  ISMMIIW,  ft.r..  t«,  mt  HW. 

«$•  (higuHMt**  #\**rkll.  Pm0i**4  ffiAfl*!.*  10  |Bw.  f»W.  /OftO 

I ’Wo  JOIM  CWNI*»  y*|,  *),  fn-v-t  (|». . 

»-*»^lMW*.  t».C*.  K •  I*7II*I0S#. 

•  I.  IMM**.  •gyooy, »f  Vjl  h» -  JkMs  vi|*f  a  9  <*.■*.  <w  •»"#%.  | m, 

2J,  C%ff  «>**■•  •  4.9.,  v<4  1.  '  »(rtiNp»4»  *■»<* 

P«lf  #tnvr*fT«-  *»*♦••*.  f  t  -.  (Wt*S  I1*"*  <■  1 '  **  Mi«t  i  ■«***. 
C«*f*f*Bw*.  •*‘*"1.  }<•  9****  tf*..  -tu*t %!♦»** .to,  *m,  ||--*|. 

'  «*,  fc-nii  t.  »t  At.  o*  Urtllk'  f -f  Ik  0/1  •#•<*»  r’l*l*** 

irtllO)  li.i>  IM.  N*l.  *♦*■«», 

i<  ilo',,,n  *#'4  i*  4  *s»4  •&***,  •'*•*'14  %-  "H  *»♦*  »K(f  |*» 

Of  ftv>  Ollt  i  *  t|A  1  SMl  9'|  0M0*'*r4  Wr»  t-«h  »  *"  *  T  lb) 

'  ir-mt----  t  h*t  -  |*»t,  •  »  |WI 

27.  B|#|0*  /"I.  t-  ’  *  IB**  to  I'M***  f*t  *£*<0**  +*■'•  •  it  B  .-*  »  , 

0  til  |V  }4I.  It*. 

.*»«.  h'ltlf*.  M  O*  *.  7ft*  "•«**■  *,|lH‘rW 

f  #r  wMw-f  At  IS*  ftptrolos*  *0  *?M*t  im  <  ,**•■*  tf  i**l*  |*  *, 

*  1*1,  h 

/?.  **SM  Ihkft.  *7*  Owtk'fc*  of  a  OtflMlfiti  to*  *|« 

*W  1^.-4  tktll  l?‘ft  >*M|. 


► 

I 


Evsroit,  III.  e*n#ralt*ad  UqrAnoe  iwltipli^r  n^thod  for 

nolviivj  probte**  o *  opt  i*ur»  allcv-Mion  of  rosontc#*.  II.? 

(ftiy-Juo#  19/.3I,  ?>*-4P. 

ftevpo«.  fMvid  f.  Cn  owrcoMiw  hioS-prlor Uv  n,ir*lv#i'>  »n  milU- 
t»roqr*wti»M»  *y*i*wt.  a  ea***  hiitorv.  co*w.  frHt  11  ,ff  fA«i«u  l°Atl. 
5W-S41. 


AITW»J*  h 


* 


f*©»r  er 


IA-II 


(i  t  l*J)  C«  l'«  Vf  Into.  U^f  ►flf'tllV  jM&l’AWte'Al  #| 

n 


Vtgn  »i.  .itiwf  H  v,. 


V  *1 


^  I  tt  \  • 


v*l ht  ,,  *  L  «%•*  <ff  *■  *  -*M  V*»  *  I 


*w# 

Ifei**  ***|«**wirt  ♦;« 


A 


IWI 


|I«A|.  ^  ^*%S.*%*  #  •  W  *  *  * 


r  *  ,  *  »  4  #»*, 


t*.  *  j».' 

*  «s*# 


m4  *  I 


ll  ♦!‘  »SI  ft*  tf'ViM  l!W  .**  *+  <►#  >A*|»  I  **  |*r  *  *> 

•f  U’J),  «m*n  fV  A.  *  I*  «il  1ft, *ft  *»*  *»,*  >1*11 

*i  «f  Wf-JI. 

II  IW*  I'S*!  f*if  A.  *  i  4***  ft  #  1*.  it  1*  INN*!^**  rv#l  |v  .  *V  u  ,, 
N'fw«  **  I  «••*•  *•$*.*  ttlfi,}^  It  *»'|«-|  M*«  I  *«»  *• 

M‘»*  ft*?i  rf»*  ****1 » #  i«4  €*  **!#  #  T*  *  •'#■***■*  *•*?♦*»  i«*j  *"4  Af 

W*il  t%*  ft  *  »  l  #  i', 


»•  m  h*,  Y|(  •  <•„  *  V,.,.  *  l 

*  »*  i  '•  ‘Vi-*,,:1 

thUKI  V*  « 

5  1*4, •  _  iji^i  ffif  **1  iM«  tt m#*  iw  fi* 

Ft 

It  El  |M|  f|  |*  I^iirl 

tif  #t**l»<**  *5*  Ft  tA-lt.  rw  w  #  I  «*0‘  #  I*.  J'(i,  ^  4,  , 

V)  •  **„♦  ***».•,  •  fV, 


wtt  Dl<  * 


i 


* 


c*  'vmcnhp  }, 

Tr*  I*  t»|¥vf|  |A  iM**' 

I  *>**• »  4  41***!!  $*1*4 IV*  K".  *  ****#ll* 

IJW*  Pi4*lWljp'  #4  F  1*  *<H  4  *«»j  **»•>•*  ffc*  p#wff***  ♦*"  •*  *  »t  |*r'48 


v 


i 


r*l<li 


ttf  i*  ***»«  t*  Mf*w*  «miH  «>#  ♦*•*< 

M  *  t.  It  !•«*  ♦*  *nl**r'«  <*«»*  «'  >  V>l  *t‘<  ♦•*l*TrS***<t 

»4  *»m4i  4  ■v*r  W*t  !♦»  **«  f  „  *  4Al 


V  »4  11.4*1 


Vi#  IV-I.M 


s 

*  # » 


♦  I  4 


•  •  tt  l*'*  *«•  4#W«*  ,  | Sue  *■*«♦**■** 


M!»*  *t*  »*4"4  *•* 


v  ^  (I  *  «  I  It  *  *,  ,1 

*  If  r! 


► 


V  *  "  ‘V  "  '  *})’  "  *  V-i.j' 

V  .  i  *  • 


w  *  H 

* 


ll  «»))  *<***  If  lw#  i***  ♦  *  |  »’h>#  |  ■  |  $  |  inf 

V*  v».j  ■  '•  •■«••** 

ii  >  *!•».> 


nv*  0«  #*»«*:* «-  <***t #«)*«>*«  f  *.».  *■*  i»i  f«  ««.|  *>#  •+. 

f  ir*  f*M*Mi*»  «-*  |  i  *  t  f  **  •»*..*»#  «*  ». 

n»««  ?|f  *n  Hl’t)!1*  »»«"W«nU ,  "'»  ff  f|*.e 

t<*»*n  fta*#  «*♦»  m 

**•  ‘  Vi  . . 1*1  ’  ’..i  Vi  -  •—  ’  V* 

■.#*<$  «i»  *»<.:<«*>*($ 

*  ’*i*i . ’'.*1  -  *\.i  *.-i  . . V. 


3K^  #» V  nt»|$> 


**  W«»*  ; 


%  *  **%  V*  *  I  I  *  Ml 


t*  *  C  It*  *  I  a  #  i»  «  w*  | 

m*  *i*i  »'  **.»  r*|.|  i-r 


.  «»•.„«  ..  M.V|)I  i»,ii  •  , t)H 


■  *..l  *  "I 


M»  IMl 


£  *  t*  ***  f  4 

«***  i 


.-.-I.:  ’  Vt  V*  Ji 

«* 

ft  *  ■  -  £ 


* 

f 


♦  *  t 
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3.  Haw,  obviously,  *ny  priority  fesslewMjnt  different  froa  th«* 
Of*lim.4l  priority  *6B»«MWw»ni  of  th*nre«  3,  is  such  that  thor»*  exist 
tv  u* *r*  i  fend  i»l  such  thoti 

‘  1 

41*1  >  4f  1  .1 

Ci  «i.| 

Th'f*  *w  priority  siUsimt  rven-trlvally  different  fro«  the 
m*  of  t**#r*n  1  c*a  I’1*  1^  fou*4. 


APPENDIX  C 


PROOF  OF  THEORE!  7. 


1)  There  are  only  2  jobs. 

Before  proving  the  theorem,  the  following  lemma  will  be  proved: 

Lemma :  if  ^21'  Wl  ant^  W2  are  Posit’-ve  numbers  less  than 

1,  then  equations 

(C-l)  (1  +  \12)  +  X21  w2  1  1 


(02) 

imply 

(03) 


Au  "[Ml*  \21>  w2  <  1 


“l  *  1  ^12  *  A21>  »jil 


Proof;  equation  (03)  is  achieved  bv  multiplying  equation  (C-l) 

by  (1  -  \  12)  ,  equation  (02)  by  X^,  and  adding. 

Proof  of  the  theorem:  Assume  that  V  i,  i  ¥  k  <=>  a  t  a  .  It 

13  kj 

mutt  be  shown  that  equations  (ITI-10)  imply  equations  (04)  : 


(04) 


■f  z 


liquations  (II I- 10)  may  bo  rewritten  as  equations  (C-l)  and  (c-2) 


Ill 


According  to  the  lemma,  this  implies  equation  (C-3) ;  now  note 

that 

Za , .  <  \  .  +  \  .  . 

lk  -  ''  i]  ''  31 

k 

a.,  <  a .. 

ik  ^  ]k 

so  that  equations  (C-4)  are  verified. 

2)  There  is  only  one  resource. 

It  must  be  shown  that  equations  (C-5)  imply  equations  (C-6) : 

(  C-5  )  Vi6[l,n]  ,  w  +  X  min  ^  ai  wi  '  \  wk  ^  1 

kj^i 

(  C-6  )  Vi  t  [l,n]  ,  wi  +  \  \  4  1 

a,  <  a. 
k  1 

Consider  the  set  S  of  jobs  such  that  kCS  <=>  a^^  >  a^w^. 

Does  there  exist  a  job  k'e  S  such  that  w^ ,  >  w^? 
a)  yes,  there  does.  Then  choose  k'  such  that  there  is  no  job 
in  S  whose  progress  ratio  is  less  than  w^ ,  and  greater  than  w^.  Then 
equation  #k'  of  (C-5)  implies  equation  #i  of  (C-6). 

b)  there  is  no  such  k'.  This  means  thatVkeS,  w  <w  ,  and 

K  X 

^>8^.  Thus  equation  #i  of  (C-5)  implies  equation  #i  of  (C-6). 

3)  Note  that  the  theorem  is  not  valid  for  any  number  of  jobs  and 
any  number  of  resources,  as  shown  by  the  following  counter-example: 


r 

* 


; 
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equations  (ill-10) ,  but  not  equations(III-2) . 

However,  I  suspect  that  the  ontimum  of  w  is  alwavs  hiqher 

i 

with  equationS(lIl-2)  than  with  the  constraints  of  enuations (III-10) . 


